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RANDOM RECURRENCE EQUATIONS AND RUIN 
IN A MARKOV-DEPENDENT STOCHASTIC 
ECONOMIC ENVIRONMENT 1 

By Jeffrey F. Collamore 

University of Copenhagen 

We develop sharp large deviation asymptotics for the probability 
of ruin in a Markov-dependent stochastic economic environment and 
study the extremes for some related Markovian processes which arise 
in financial and insurance mathematics, related to perpetuities and 
the ARCH(l) and GARCH(1, 1) time series models. Our results build 
upon work of Goldie [Ann. Appl. Probab. 1 (1991) 126-166], who has 
developed tail asymptotics applicable for independent sequences of 
random variables subject to a random recurrence equation. In con- 
trast, we adopt a general approach based on the theory of Harris 
recurrent Markov chains and the associated theory of nonnegative 
operators, and meanwhile develop certain recurrence properties for 
these operators under a nonstandard "Gartner-Ellis" assumption on 
the driving process. 

1. Introduction and summary. In a variety of problems in insurance 
mathematics and risk management, as well as other applied areas, it is rele- 
vant to study the tail probability of a random variable satisfying a stochastic 
recurrence equation. An example of this type arises in risk theory, where the 
objective is to characterize the probability of ruin of an insurance com- 
pany whose losses are governed by Lundberg's (1903) classical model, but 
where the company earns stochastic interest on its capital. In the setting 
of stochastic investments, the analysis of ruin departs substantially from 
the renewal-theoretic approach typically employed for Lundberg's original 
model. Instead, one introduces an associated process {W n }, defined below, 
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and observes that 

P{ruin} = Pjsup W n > u\, 

where u is the initial capital of the company and W := sup n W n satisfies the 
random recurrence equation 

(1.1) W = B + Amax{0,W} 

for certain random variables A (associated with the investment process) 
and B (associated with the insurance business); see Section 2.1 below. The 
characterization of ruin then centers around (1.1) and, in particular, the 
tail decay of its solution as u — > oo. In this setting, it is known that the 
probability of ruin decays at a certain polynomial rate, namely, 

(1.2) F{W>u}~Cu- v asu^oo 

for constants C and r; see, for example, Goldie (1991), Nyrhinen (2001), 
and, for the continuous time case, Kalashnikov and Norberg (2002), 
Paulsen (2002), Pergamenshchikov and Zeitouny (2006) and 
Kluppelberg and Kostadinova (2008). Rough large deviation asymptotics in 
a general setting have also been developed in Nyrhinen (1999). 

Related recurrence equations arise, for example, in life insurance mathe- 
matics, where attention is focused on perpetuities, which describe the dis- 
counted future payments of a life insurance company; and in financial time 
series modeling, where it is relevant to describe the tail decay for the now- 
standard ARCH(l) and GARCH(1, 1) models, used to quantify the logarith- 
mic returns on an investment [cf. Engle (1982), Bollerslev (1986), 
Embrechts, Kluppelberg and Mikosch (1997)]. In these cases, the solution 
is obtained by solving a random recurrence equation closely related to (1.1), 
namely, 

(1.3) V = B + AV. 

In the case of perpetuities, the random variables A and B are once again 
determined by the investment and insurance processes, respectively, and V 
describes the future financial obligations of the company; see Section 2.3 
below. It is known — both for the case of perpetuities and for the ARCH(l) 
and GARCH(1, 1) financial models — that 

(1.4) F{V >u}~ Cu~ x as u -> oo 

for constants C and r [cf. Goldie (1991), Mikosch (2003), and, for a continous- 
time version, Carmona, Petit and Yor (2001)]. Other relevant results for per- 
petuities can be found, for example, in Dufresne (1990) and Cairns (1995). 
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These diverse problems are unified through (1.1) and (1.3), which state 
that the random variable of interest statisfies an equation of the general 
form 



for some real- valued random function Solutions to such recurrence equa- 
tions have been developed in Kesten (1973), Grincevicius (1975) and 
Grey (1994), and particularly Goldie (1991), who introduced an approach 
based on "implicit" renewal theory, which is widely applicable in the setting 
of (1.5). Based on the results of Goldie's paper, one readily obtains estimates 
such as (1.2) and (1.4), as well as various estimates relevant, for example, 
in queuing theory and other applied areas. 

If, however, the financial or insurance process arising above is Markov 
dependent, then the above approach breaks down and the situation is actu- 
ally quite different, and it is not possible to develop corresponding estimates 
to (1.2) and (1.4) based on the recurrence equations (1.1) and (1.3). Such 
extensions are nonetheless of considerable applied relevance. For example, 
the investment returns of an insurance company would not generally be in- 
dependent, nor could they be described as a Markov chain in finite state 
space. A realistic model would typically involve a state space which is, say, 
real-valued and therefore uncountable, and would have increments which are 
unbounded. This is the situation in many of the standard financial models, 
such as the ARMA, stochastic volatility and GARCH time series models. 

The objective of this article is to study (1.2) and (1.4) in a Markovian 
setting where the insurance and, especially, financial processes involved are 
driven by a Harris recurrent Markov chain. Based on the regeneration tech- 
nique of Athreya and Ney (1978) and Nummelin (1978), we shall show that 
recurrence relations similar to (1.1) and (1.3) can be obtained. [It should 
be noted that this approach differs markedly from known methods for sim- 
ilar problems; cf., e.g., Nyrhinen (2001), de Saporta (2005).] In a general 
Markovian setting, the analysis of these equations turns out, however, to be 
considerably more complicated than in the independent case, and a main 
aspect of our study will be centered upon the regularity properties of certain 
random quantities formed over the regeneration cycles of the Markov chain. 
In Theorem 4.2 below — a central result of this paper — we develop regular- 
ity properties closely related to geometric r-recurrence for the operator P t , 
where 



and P is the transition kernel of the underlying Markov chain and / a real- 
valued function. Geometric r-recurrence plays an important role in the large 
deviations theory for general Markov chains [cf. Ney and Nummelin (1987a, 



(1.5) 



Z = $(Z) 



P a (x,dy) 



:=e afiy) P(x,dy) Va 



1987b)]. 
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In particular, there has been much recent attention focused on establish- 
ing geometric recurrence for a given Markov transition kernel P, which is 
typically achieved by verifying a Lyapunov drift condition, namely, 



for some function V > 1, "small set" C and constants b < oo and p < 1 
[cf. Meyn and Tweedie (1993), Chapter 15]. However, validating a condition 
such as (D) with the operator P x in place of P — which would yield geometric 
c-recurrence and some extensions developed here — is generally much more 
difficult. In this paper, we propose an alternative approach, based on the in- 
troduction of an auxiliary "/i-function" and the verification of an associated 
"Gartner-Ellis" limit and minorization condition. These conditions provide 
an alternative to (D) which, at least in the context of our examples, can 
be verified somewhat more naturally. [An alternative approach has recently 
been introduced in Kontoyiannis and Meyn (2003, 2005). The relationship 
between their approach and ours is discussed in more detail in Section 2 be- 
low.] In our development, we shall rely heavily on the theory of nonnegative 
operators, as summarized in Nummelin (1984). 

We conclude by mentioning some recent work on Markov-driven perpetu- 
ities, as described in (1.4) above. An extension of (1.4) to finite state space 
Markov chains has recently been obtained in de Saporta (2005). Also, an ex- 
tension to functionals of continuous-time Markov processes has been given 
in Blanchet and Glynn (2005), but under a boundedness assumption on the 
functionals which is violated in the examples we consider here. Specifically, 
their assumptions in our problem would imply that the sequence {J27=i Ai} 
is bounded, leading to exponential rather than polynomial decay for the 
probability of ruin. In fact, our emphasis on unbounded processes {A n } will 
lead to the main technical difficulties which we shall encounter below. In 
contrast to both of these papers, our methods will be based on the regen- 
eration properties of the underlying Markov chain, and regeneration will be 
central to our approach here. 

An outline of this paper is as follows. In the next section, we give a 
more precise description of the ruin problem with stochastic investments, 
as introduced in (1.1), (1-2), and then turn to a Markovian formulation of 
this problem. Next, we describe the same Markovian formulation, but for 
perpetuities and the ARCH(l) and GARCH(1,1) financial models. Some 
examples are given in Section 3, proofs are given in Sections 4 and 5 and 
generalizations are briefly discussed in Section 6. 





RANDOM RECURRENCE EQUATIONS AND RUIN 5 
2. Statement of results. 

2.1. A description of the insurance risk model in the i.i.d. setting. We be- 
gin by recalling the classical Cramer-Lundberg model for the capital growth 
of an insurance company, namely, 

(2.1) Yt = u + ct-J2&, 

i=i 

where u is the initial capital of the company, c is the premiums income, {^} 
are the i.i.d. claims losses and {Nt} is a Poisson process, independent of 
which describes the occurrence times of the claims. These assumptions 
imply that {Yt} is a Levy process, with i.i.d. losses over unit intervals given 
by 

B n :=-(Y n -Y n ^), re =1,2,.... 

It is assumed that {Yt}t>o has a positive drift or, equivalently, that {B n } ne x + 
has a negative mean. 

We now depart from this model by introducing a financial process de- 
scribing the investment returns. Assume that the return rate during the nth 
discrete time interval is given by r n , and let Z n denote the total capital of 
the insurance company at time n. Then 

(2.2) Z n = (1 +r n )(Z n _i - B n ), n=l,2, ... and Z = u. 

[One could alternatively assume that Z n = (1 + r n )Z n _i — B n , and the anal- 
ysis would carry through without significant change.] Setting R n = 1 + r n 
and solving (2.2) recursively for Z n yields 

Z n = Rn{Z n -l — B n ) = R n R n _i(Z n _2 — -Bn-l) — RnB n 

(2.3) 

— • ■ • — (Rn ■ • ■ Ri)Zo — (Rn ■ ■ -Ri)Bi — • • • — R n B n . 

Assuming r n > —1 a.s. for all n, we may set A n := 1/(1 + r n ) to be the 
stochastic discount factor, and multiplying left- and right-hand sides of (2.3) 
by A± ■ ■ ■ A n yields 

(2.4) (A 1 ---A n )Z n = u-W n , 
where 

(2.5) W n := B x + A X B 2 + ■ ■ ■ + (A x ■ ■ ■ A n ^)B n , 

that is, W n represents the total discounted losses incurred at times 1 < i < n. 
Let 

T^ = supW„. 

n>l 

Then {Z n < 0, for some re} {W > u}, by (2.4), and hence the probabil- 
ity of ruin is given by 

(2.6) *(tt) :=F{W>u}. 
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2.2. The Markovian case and a statement of our results. Our next ob- 
jective is to generalize the above formulation so that it allows for Markov 
dependence. In the interest of simplicity we shall first assume that there 
is dependence only in the investment process {^4 n } ; and that the insurance 
process {B n } is i.i.d. and independent of {^4 n }- ln Section 6.1 below, we shall 
discuss a slight generalization which allows for Markov dependence also in 
the sequence {B n }. 

We begin, then, with a Markov chain in general state space, denoted by 
{X n }, and assume 

(2.7) logA n = f(X n ), 

where / : § — ► R and is typically unbounded. (We could equally well assume 
that the function / is random; see Remark 2.3 below.) 

We suppose that {X n } is time- homogeneous, taking values in a countably 
generated measurable state space (§,<S) with transition kernel P(x,E) and 
/c-step transition kernel P k := pp k ~ 1 for k > 1. Assume that {A n } is ape- 
riodic and irreducible with respect to a maximal irreducibility measure (p. 
[For the definitions and a further characterization of these conditions, see 
Nummelin (1984) or Meyn and Tweedie (1993).] As an additional regularity 
condition, we suppose that for 92-a.a. initial states x € §, the distribution 
of J27=i^°&Ai is spread out for all n > N(x), where N(x) is some positive 
integer. 

Under the assumption that {X n } is irreducible with respect to 99, there 
exists a minorization for {X n } [Nummelin (1984), Theorem 2.1]; namely, 

(Mo) 51 c {x)v(E) <P k {x,E) VxgS, EeS, 

for some k € Z+, 5 € (0, 1], a probability measure v on (S, S) and a "small 
set" C € S, where <p(C) > 0. Here we will work with a strengthening of this 
property, condition (SDT), which will be described below. 

Notation. 

5 n = log^iH \-log A n , ra=l,2,... and S = 0; 

S^ = h(X 1 ) + --- + h(X n ), n = l,2,..., 

for any measurable function h : S — ► [0, 00); 

A(a) =limsup-logE[e Q5n ] Va € R; 

A B (a) =logE[\B 1 \ a ] VaGR; 

A(a) =limsup-logE[e Q5 " +/3S ' (l,l) ] Va = (a,/3) G R 2 ; 

n^oo 71 

>Co^ = {a; G S : /i(x) < a} Va G R, for any function /i : § — ► R. 
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Also let lc denote the indicator function on C, and for any measure u, 
let suppf denote the support of v. (In the notation for A and A, we have 
suppressed the dependence on the initial state of the Markov chain. However, 
in Proposition 5.1 below, it will be shown that these quantities are actually 
independent of this initial state.) 

The functions A and A are the "Gartner-Ellis" limits arising in large 
deviation theory [cf. Dembo and Zeitouni (1998), Chapter 2]. Roughly, A 
can be equated to the spectral radius of the transform kernel 

(2.8) P a (x,E):= I e af ^P(x,dy) Vx € §, E E S, 

JE 

where / is given as in (2.7), and similarly for A; cf. de Acosta (1988), Section 
7. 

We turn now to some assumptions on the Markov-additive process {(X n ,S n ) : 
n = 0,l,...}. First note that, if the average return rate is positive, then 
1 + T\ > 1 "on average," and hence it is reasonable to expect that 

E n [Ax] ^E^Kl + n)- 1 ] e(0,l), 

where tt is the stationary probability measure of {X n }. By Ney and Num- 
melin (1987a), Lemmas 3.3 and 5.2 (and their proofs) and an application of 
Jensen's inequality, we then obtain under very weak regularity conditions 
on the Markov chain that, if A is finite in a neighborhood of zero, 

(2.9) A(0) =E^[logAi] <0. 
Hence 

(2.10) r:=sup{a:A(a)<0}>0. 

Moreover, if P^lAi > 1} > 0, then r < oo, so the solution to (2.10) is in fact 
a proper solution. 

In addition to the existence of the solution in (2.10), we will also need to 
assume a further regularity condition on A, roughly stating that it is finite in 
a neighborhood of r, and that for some choice of h, the generating function 
of {h(X n )} is sufficiently well behaved around the origin. To motivate this 
condition, set 

n 

h(x) = \f(x)\ and S- = ]T(-/(X 4 ))V0 

i=l 

and observe by Holder's inequality that 

(2.11) E [ e «5„+/3si h) ] < (E[e p(a+/3)5 ' l ]) 1/p (E[e 29/3S " ]) 1/<? , 
where p~ 1 + q^ 1 = 1. Then 

(2.12) A((a,/3))<-A(a) + -A(-e), 

P Q 
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where a = p(a + (3) and e = 2qf3. Choosing (a, (3) sufficiently close to (r, 0), 
then p sufficiently close to one, and finally letting (3 \ 0, we conclude that 
if A is finite in a neighborhood of the interval [0, r] , then 

(2.13) A((a,/?))<oo for some a > r and j3 > 0. 

More generally, if we take h to be an arbitrary function, then in place of 
(2.11) we obtain 

(2.14) E[e aSn+ps " h) ] < (E[e paSn ]) 1/p (E[e ql3S " h) ]) 1/q 

and hence (2.13) still holds, provided that A is finite in a neighborhood of 
r and the generating function of {Sn } is finite in a neighborhood of zero. 
Hence, we may also choose h{x) = \\x\\ or a more slowly increasing function, 
such as log 1 1 a: || V 0. Later, we will relate the function h to the minorization 
condition (971), given below, and for this reason it will often be necessary to 
choose h to be different from |/|. This is because we will need h(x) to tend 
to infinity as ||x|| /* oo in order for (9Jt) to be satisfied. See, for example, 
Example 3.3 below. (Also see Example 3.1 for another case where we would 
generally not choose h = |/|.) 

The above considerations provide motivation for the following. 



Hypotheses. 
(Hi) t£(0,oc). 

(H2) There exists a function h : § — > R and points a > r and (5 > such that 
A((a,/3)) < 00 and A#(a) < 00. 

Next we introduce a further condition which regulates the behavior of 
{A n } and {A^ n } on the level sets of the function h appearing in (H2). To 
motivate this condition, note that for a very large class of financial processes, 
there is some form of stochastic domination. A simple example of this type 
is the AR(1) process, 

(2.15) X n = cX n ^x + Cn, n = l,2,... and X = x, 

for |c| G (0,1) and {Cn} an i.i.d. sequence of standard Gaussian random 
variables. Then 

Xq = z <J=4> X\ ~ Normal(cz, 1). 
Consequently, if c> then 



(2.16) x<y =^> P(x,E) <P(y,E) for all sets E C 



(x + ?/),oo^; 
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and conversely, 

x<y => P(y,E) <P(x,E) 

(2 ' 17) 

for all sets E C f — oo, -(a; + y) , 

where P denotes the Markov transition kernel of {X n }. Therefore, if C a h C 
[—b, b] then 

(2.18) P(x,E)<P(-b,E)+P(b,E) for all x G £ a /i and E £ S. 

If c < 0, then the inequalities on the right-hand sides of (2.16) and (2.17) are 
reversed, but (2.18) remains valid. This type of reasoning can be generalized 
to include, for example, general ARMA(p, q) models by using a vector rep- 
resentation for the process {^ n }; cf. Meyn and Tweedie (1993), Chapter 2 
and Section 6 below. More complicated financial processes can be handled 
similarly. 

Of course, a representation such as (2.18) would be quite meaningless if it 
were only to hold for the individual points b and —b, which have c/2-measure 
zero, whereas (2.18) actually holds for every b > b. For this reason, in (2.18) 
it is natural to substitute sets of positive ^-measure for the individual points 
—b and 6; and in this way we arrive at the following general condition. 

(H3) For any a > 0, there exist ^-positive sets E\, . . . , E\ C S, possibly de- 
pendent on a, and a finite constant D a such that 

P{x,E) < D a \ni\^P{x u E) -. Xi £ Ei,l <i Vx 6 C a h, EeS. 

Finally, we introduce a strengthening of the minorization (.Mo) described 
above. 

Minorization (971) . For any a > sufficiently large, there exists a con- 
stant 5 a > and a probability measure v a on (§,5) with v a (C a h) > such 
that 

(M) 6 a l Cah (x>a(E)<P(x,E) VxeS,£eS. 

It should be emphasized that the function h in (M) is the same function 
as that appearing in (H2). Thus, while (H2) benefits from a small choice of 
h, the minorization (M) benefits from a large choice of h and, in practice, 
a balance is needed in selecting a proper choice for this function. 

Remark 2.1. There is no loss of generality in assuming that suppf a C 
C a hf] Cbf, where b € R is arbitrarily large, since we may always truncate 
the measure u a in (Ai) and the minorization will still hold. In the sequel, it 
will always be assumed that v a has been chosen in this manner. 
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Remark 2.2. For simplicity, we have taken k = 1 in (M) [compare 
(A4o)], which may be restrictive in certain examples. A generalization to 
the case where k > 1 will be discussed below in Section 6.2. 

To see how (971) relates to some more standard conditions, suppose for 
the moment that {A n } is uniformly recurrent, that is, 

Su (E)<P(x,E)<du (E) 

Vx G S, E G 5, for some probability measure fo- 

In this case, the C-set in (Mo) may be taken to be the entire state space, 
which means that (9Jt) then holds with h = l. Incidentally, from (9t) we also 
obtain 

(2.19) P(x,E) <^P(x ,E) Vx G S, £ G 5, for any x G §. 

Consequently (H3) holds. Thus, in this setting, our conditions reduce essen- 
tially to (H2) with h = l, namely, we require that A(a) < 00 and A#(a) < 00, 
for some a > r. 

In the examples below, the upper and lower bounds in (9t) will not be 
satisfied, but in place of the lower bound we will have 

(M') S j l Cj (x)u j (E)<P(x,E) VxGS, EeS, j GZ+, 

along an appropriately chosen sequence of sets Cj /" S. Then an unbounded 
function h may essentially always be found which satisfies condition (3JI), 
although this imposes an additional constraint on (H2) as compared with the 
case h = l. For example, a typical choice for h would be to take h(x) = \\x\\, 
in which case the level sets C a h would tend to § as a — > 00. In essence, then, 
(SDT) requires that the minorization (A^o) hold for an arbitrarily large choice 
of C, but it is not required that C = S. In particular, our assumptions lie 
somewhere between the minorization condition (Mo), which is essentially 
always valid, and the much stronger condition that the Markov chain be 
uniformly recurrent. Nonetheless, these conditions are flexible enough to 
handle some reasonably complicated financial models, as will be seen in 
Section 3 below. 

We note that if P were replaced with P kj on the right-hand side of (M'), 
then this condition would indeed be an exceedingly mild requirement [cf. 
Meyn and Tweedie (1993), Proposition 5.2.4]. 

Theorem 2.1. Let {(X n ,S n ) : n = 0, 1, . . .} be a Markov- additive process 
and let h:S — > [0, 00) be a function satisfying conditions (Hi)-(H3) and (OR)- 
Then for ip-a.a. x G S, 

(2.20) lim u^Ju) = C, 

u— >oo 

where C G [0, oo), and x E (0, oo) is given as in (2.10). 
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Remark 2.3. As an extension, one could suppose that the function / 
in (2.7) is random. In Ney and Nummelin (1987a, 1987b), Markov-additive 
processes are studied which take the general form S n = £i H h £ n , where 

P{(Xn,Zn) G E xT|5n-i} = P{(X n ,£ n ) € E x T|X n _i} = / P(x, dy)Q(y ,T) 

JE 

for some family of probability measures {Q(x,r) :x € S, T € 72.}, where 72 
denotes the Borel cr-algebra on K and 3n = c{Ao, . . . ,X n ,£i, . . . ,£ n }- Thus, 
with a slight abuse of notation, we may write £ n = f n (X n ), where {f n {x) : x 6 
S, n = 1,2,...} is a family of independent random variables, also independent 
of {A n }, whose elements have, for fixed x, a common distribution function. 
But then {(X n ,£ n )} is itself a Markov chain, which inherits a minoriza- 
tion from {A„,}, and clearly £ n = f(X n ,£ n ) for the deterministic function 
f(x,y) = y. In short, the introduction of a random function in (2.7) does 
not lead to additional generality, at least in principle, and the previous the- 
orem could also have been phrased at that level of generality. 

Remark 2.4. A precise representation for the constant C can be as- 
certained from the proof in Section 4. Under very weak conditions, it can 
be shown that this constant is positive. However, we will not explore the 
precise conditions here. 

2.3. Further remarks on our hypotheses. Before turning to our next re- 
sult we would first like to comment, briefly, on the comparison of our ap- 
proach to some other methods in the literature. 

An alternative approach would be to replace (H2) and (H3) with a weaker 
Lyapunov drift condition, namely, 

(Si) / V(yY^P(x, dy) < pV(x) + bl c (x), 

where C is a small set, say, and p < 1. Indeed, we shall actually utilize (H2) 
and (H3) together with the inherent eigenvalue and eigenfunction properties 
of the chain to deduce 

(2) 2 ) / V(y)e vfiy) P{x,dy) <pV{x) Vx £ S, 

Jc c 

where C = C a h is the small set in (M), and this is roughly equivalent to 
(£>i). [More precisely, if r = then either condition may be used to obtain 
geometric recurrence; cf. Nummelin (1984), Meyn and Tweedie (1993). It 
should be remarked that we will actually establish and apply a condition 
slightly stronger that (2)2)-] In this paper, the deduction of (D2) will be 
obtained by an indirect argument, which will constitute an important part 
of the proof of Theorem 4.2 below. 



12 



J. F. COLLAMORE 



Nonetheless, (£>i) could also be viewed as a possible starting point for our 
results, and this approach has recently been followed, for example, by Chan 
and Lai (2007) and Balaji and Meyn (2000). For countable state space chains 
Balaji and Meyn (2000) have shown that (Si) is equivalent to geometric 
recurrence for the r-shifted chain. This r-shifted chain will also appear in our 
analysis, in Section 5, and in Theorem 4.2 we shall develop similar recurrence 
properties, although our methods and the exact statement of our results 
will be quite different. Specifically, instead of geometric recurrence, we shall 
establish certain related moment properties. The connection between these 
two notions will be explained in more detail in Section 5. 

In a comprehensive study, Kontoyiannis and Meyn (2003, 2005) have con- 
sidered certain extensions of Balaji and Meyn (2000) to general state space 
chains. Specifically, they have developed multiplicative mean ergodic theory 
and its connection with the multiplicative Poisson equation. These results 
relate closely to the existence and characterization of the eigenvalues and 
eigenfunctions associated with the kernel P a . We refer the reader to Kon- 
toyiannis and Meyn (2003, 2005), where a survey of some other related 
results can also be found. 

In the context of our examples, it often seems more natural to verify (H2) 
and (H3) than a condition such as (2)i), assuming that the driving Markov 
chain has finite exponential moments around the origin and therefore fits 
within the framework typically studied in modern large deviations theory. 
As mentioned in the previous section, a sufficient condition for (H2) to hold 
is the finiteness of the Gartner-Ellis limit, A, in a neighborhood of r, and 
the finiteness of an associated limit for {Sn} in a neighborhood of zero, 
where necessarily A(r) < 00 if the conclusions of our main results are to 
hold. While it is often difficult to obtain closed-form expessions for Gartner- 
Ellis limits, their finiteness can frequently be verified by indirect means. In 
Example 3.3 below, finiteness is obtained along the positive axis due to the 
boundedness from above of the function /; whereas in this example, it would 
not be transparent that any function V should satisfy (£>i). (This distinction 
would be even more striking if the interest process in that example were 
taken to be stochastic.) In any case, our approach exposes an interesting 
interplay between the Gartner-Ellis limit of large deviations theory and 
geometric recurrence of the kernel P a , and the latter property has important 
implications in the large deviations theory for Markov chains; cf. Ney and 
Nummelin (1987a, 1987b). 

2.4. Perpetuities and the GARCH(1,1) process. A related but simpler 
problem to the one considered in Section 2.2 is the study of perpetuities. 
Assume for the moment that {(A n , B n ) : n = 1, 2, . . .} is an i.i.d. sequence of 
random variables, and consider the tail of linin^oo W n where, as before, 



(2.21) 



W n = B 1 + A X B 2 + . . . + (Ai • ■ ■ A n . x )B,, 
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In the context of life insurance mathematics, the sequence {B n } typically 
denotes the future payments from an insurance company to its policy holders 
(or vice versa) at times n = 1, 2, . . . , while {A n } denotes the discount factors 
associated with the investment returns, that is, A n = (1 + r n ) _1 , where r n 
is the return rate at time n. Here, both the processes {A n } and {B n } are 
assumed to be random. Then (A\ ■ ■ ■ A n -i)B n denotes the amount of capital 
which needs to be set aside to cover payments at time n to the policy holders, 
and 

(2.22) := lim W n 

represents the company's total future financial commitment. Under our hy- 
potheses, the a.s. existence of the limit (2.22) follows from Goldie and Mailer 
(2000), Theorem 2.1. 

An analogous mathematical problem arises when characterizing the ex- 
tremal behavior of the ARCH(l) and GARCH(1,1) financial time series 
models. In the GARCH(1,1) model, the logarithmic return of a stock at 
time n, denoted i?* , is governed by the system of equations 

(2.23) R*=o- n £ n , where o 2 n = a + hcr 2 l - 1 + ai{R* n _i) 2 , n = 1,2,. .. , 

for {£ n } an i.i.d. sequence of standard Gaussian random variables, where 
ao,ai and b\ are positive constants. Setting W* = o\ gives 

(2.24) W* = A n W*_ 1 + B n , ri=l,2,..., and Wq =j/GK, 
where A n = b\ + ai£n-i an d B n = oo- Solving (2.24) yields 

(2.25) W* = (A n --- A{)W5 + (A n --- A 2 )B l + ■■■ + A n B n ^ + B n , 

which has a similar, although not identical, form to (2.21). Then it is of 
interest to study the tail of := limj^oo W*. [Here we consider the limit 
in law. The existence of the limit distribution is then guaranteed by Goldie 
and Mailer (2000), Theorem 3.1.] For a more detailed description of the 
ARCH(l) and GARCH(1,1) financial time series models, see, for example, 
Embrechts, Kliippelberg and Mikosch (1997) or Mikosch (2003). What the 
random variables and W£o have in common is that they both satisfy 
the random recurrence equation 

(2.26) V = B + AV, 

where (A,B) = (A 1 ,B 1 ). 

It is of theoretical and applied interest to consider (2.21) and (2.25) in 
a setting where the process {A n } represents a general Markov-dependent 
sequence of random variables. Let 

^ x (u)=F{W oo >u\X = x} and %(u) = F{W^ > u\X = x). 

If we adopt the same assumptions as in Section 2.2, then as a natural variant 
of Theorem 2.1 we obtain the following. 
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Theorem 2.2. Let {(X n ,S n ) : n = 0, 1, . . .} be a Markov-additive process 
and let h:E> — > [0, oo) be a function satisfying conditions (Hi)— (H3) and (SJt). 
Then for if- a. a. x, 

(2.27) lim u x WJu) = C, 

u — >oo 

where C 6 [0, 00), andxG (0, 00) is given as in (2.10). Moreover, (2.27) also 
holds if^ x (u) is replaced with ^*(u). 

3. Examples. The objective of this section is to relate our theorems and 
conditions to some standard processes arising in insurance and financial 
mathematics. 

From a mathematical point of view, it should first be noted that if the 
Markov chain has finite state space or is uniformly recurrent, then our con- 
ditions hold without further restrictions on the Markov chain. [However, in 
that setting, the proofs of our main results could be simplified considerably.] 
Our primary objective is to consider the case of general, Harris recurrent 
chains, and this setting is indeed realistic from a financial perspective, as the 
most reasonable models often involve a more intricate dependence structure 
than can be described with, say, a finite-state chain. Nonetheless, we will 
begin with the finite-state case and first illustrate our conditions in that 
context before turning to some more complicated examples. 

Example 3.1. Assume that an insurance company receives premiums 
and incurs claims according to the classical Cramer-Lundberg model de- 
scribed in Section 2.1; thus the one-period losses, {B n }, form an i.i.d. se- 
quence of random variables. Suppose that the company invests its excess 
capital and that the investment returns are governed by the standard Black- 
Scholes model, but modified to allow for regime switching. More precisely, 
assume that the investment returns follow the stochastic differential equa- 
tion 

1 

(3.1) dS t = J2 S t(Vi dt + a tfWi)l { * LtJ=i} , 

i=0 

where {X n } is a discrete-time Markov chain taking values in {0,1}, {Wt} 
is standard Brownian motion, ^ and o~i are constants for each i, and it is 
assumed that the processes {Wt} and {X n } are independent. Here the mo- 
tivation is that, under different external conditions, represented by "states," 
the returns on the investments change. Integration of (3.1) then yields 



(3.2) 



A n :=^ = l {Xn=0} A^ + l {Xn=1} AW, 

<->n 
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where 

(3.3) A® = expj- La " y) " ^n}, {Z n } ~ i.i-d. Normal(0, 1). 

Regime-switching models have been introduced, for example, in Hamil- 
ton (1989), and have subsequently appeared rather widely in the literature; 
see, for example, Hardy (2001). (Such models could also allow for depen- 
dence between the underlying Markov chain and the conditional returns on 
the investments.) 

A slightly more general model than (3.2) would be to assume that {A n } 
is modulated by a fc-state Markov chain and that the discounted returns, 
conditional on {A n }, are independent [but not necessarily dictated by the 
distribution described in (3.3)]. Namely, assume 

(3-4) A n = Y,l {Xn=i} A^\ 

i=0 

where {(^4n°^ , • • • , An is an i.i.d. sequence of random variables and {X n } 
is a Markov chain on {0, 1, . . . , k — 1}. 

In this setting, it is easy to verify our conditions based on the observation 
that a finite-state Markov chain satisfies a condition tantamount to uniform 
recurrence, namely, 

(9t) 6u (E)<P(x,E)<du (E) VxeS, EeS, 

for some probability measure uq and certain positive constants 5 and d. 
Therefore, as already noted in the discussion following Remark 2.2, we im- 
mediately obtain (H3), and we obtain (Mo) with k = 1 and C = S, that 
is, we may choose C to be the entire state space of the Markov chain. Be- 
cause (Mo) holds with C = S, we consequently conclude that (9Jt) holds 
with h = 1. Choosing h = 1, we see that a sufficient condition for (H2) to 
hold is that A is finite in a neighborhood of the set {a:A(a) < 0}, which 
is a rather weak requirement. Finally, it is well known that (2.9) holds un- 
der (JH) [cf. Iscoe, Ney and Nummelin (1985)], and thus (Hi) holds provided 
that E^flog Ai] < 0, where ir is the stationary measure of the Markov chain. 

The above reasoning also applies in general state space if the underlying 
Markov chain is uniformly recurrent, but this reasoning will fail in the ab- 
sence of uniform recurrence. Specifically, in that case we will not be able to 
choose h = l. 

For a discussion of uniformly recurrent chains see, for example, 
Iscoe, Ney and Nummelin (1985) and references therein. 

Example 3.2. Assume again the Cramer-Lundberg model for the in- 
surance business, but now assume that the discounted logarithmic invest- 
ment returns are modeled as an AR(1) process with negative drift; more 
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precisely, 

(3.5) logA n = X n - fi, 

where [i > and {A n } satisfies (2.15). Then all of our hypotheses hold with 
h = |/|, provided that As (a) < oo for some a > r. The Markov chain {A n } 
is not uniformly recurrent, but the minorization (SDT) and other conditions 
are easily verified, as follows. 

Let P denote the transition kernel of {A" n }, and let $> x denote the Normal(x, 1) 
density function. For any fixed a > 0, set 

$a(y) = mf{$ cx (y) : \x\ < a} = min $ cx (y) Vy G R. 

x£{— a,a} 

Then 

(3-6) I <ta(y)dy<P(x,E) Vx G {x G S : \x\ < a}, E G S. 

Je 

Hence (M) holds with v a (dy) = b§?_ a (y) dy and b G (0, oo) a normalizing con- 
stant. 

To verify the remaining conditions, first compute the cumulant generating 
function, 

1 ^o? 
(3.7) A(a) =limsup-logE[e" 5n ] = -am-\ — Va, 

n— >oo Ti 2 

where m and <r are positive constants. The form of the function on the right- 
hand side is obtained by observing that S n := log^4i + • • • + logA n is clearly 
normally distributed, and so the computation of the limit in (3.7) reduces 
to calculating its limiting normalized mean and variance. Now A is finite 
everywhere and we have chosen h= \f\. Consequently the general remarks 
in Section 2 may be applied to obtain (Hi), (H2) and (H3). 

If {AT n } is an AR(p) process with p > 1, then {X n } is not itself a Markov 
chain, but instead we may consider the Markov chain X n = (X np , . . . , A^ n (p-i)+i)- 
The Markov chain {AT n } does not satisfy the minorization (9JT) [since we 
would need to take k > 1 in (.Mo)]- However, the chain {X n } does satisfy 
the minorization {Mi) presented in Section 6.2 below, and hence the results 
of this paper may still be applied. For further discussion of this case and the 
verification of our conditions here, see Section 6.2 below. Finally, if {A^} is 
a general ARMA(p, q) process, then a slight modification of our assumptions 
is necessary, but the basic approach still applies, in essence; see Section 6.2 
for details. 

Financial modeling by means of ARMA processes is now quite standard. 
In an insurance context, this forms the basis for the so-called Wilkie model; 
for an introduction in this setting, see Panjer (1998). A general introduc- 
tion to ARMA models and their applications can be found, for example, in 
Box and Jenkins (1976) or Brockwell and Davis (1991). 
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Example 3.3. Assume once again the Cramer-Lundberg model for the 
insurance business, but assume that the investments are split between a 
fixed proportion in a stock and bank, respectively. Suppose that the bank 
investment earns interest at a constant rate, say r, while the logarithmic 
stock returns, {-R^}, are modeled according to a stochastic volatility model, 
namely i?* = cr n ^ n , where is an i.i.d. Gaussian sequence of random vari- 
ables, and {log<7 n } is rather arbitrary and may, for example, be taken to be 
an ARMA(p, q) process, independent of {£n}- Such a choice for {logcr n } is 
fairly typical, and with this choice, the statistical properties of {R^} become 
mathematically tractable, leading to their popularity as an alternative to, 
say, GARCH models. For a futher description of these models and their sta- 
tistical properties see, for example, Mikosch (2003) and Davis and Mikosch 
(2008a, 2008b). Recent developments can also be found, for example, in 
Shephard (2005). 

Under the above assumptions, 

(3.8) A n = (p(l + r) + (l-p)e R ^y 1 where R* n = <r n £ n , 

for some constant p € (0, 1), and thus A n is deterministically bounded from 
above, uniformly in n. 

In this example, the Markov chain is {A n } = {(log<7 n ,£ n )} C M 2 , and we 
may take h(x) = \\x\\ for all x € M 2 . Note, in particular, that we would not 
want to choose h = \ f\ since: (i) in order for (971) to be satisfied, we would 
need h to tend to infinity when ||A n || /" oo; and (ii) in order for (H2) to 
be satisfied, we would need A to be finite in a neighborhood of zero, but 
{f(X n )} has heavy tails along the negative axis. Thus, h = \f\ is not a 
suitable choice in this case. 

Now, although the Markov chain {(logcr n , £ n )} is two-dimensional, the 
analysis of it simplifies considerably due to the fact that {£ n } is i.i.d. and 
{log a n } is independent of {£ n }- Indeed, since E[e Q ^] < 00 for all a, and 
since the ^4-sequence is deterministically bounded from above, it is actually 
sufficient to verify (9Jt), (H2) and (H3) for the process {logcx n } [using that 
h(x) := ||x|| < \xi\ + \x2\ in the verification of (H2), and using (2.14) in place 
of (2.11) for this verification]. Moreover, since {logcr n } is assumed to be 
an AR(1) process, these properties can be obtained just as in the previous 
example, while more general ARMA processes may also be considered, as 
discussed in Section 6.2 below. 

The verification of (Hi) is, however, somewhat more complicated than 
in the previous example. Since the function A is actually infinite along the 
negative axis, the reasoning leading to (2.10) — which gave the positivity of 
r in (Hi) — is no longer applicable. To circumvent this difficulty, introduce 
the left-truncated sums 

5 (M) = / (M) (Xi) + ... +/ (M) (Xn); 

where f^ M \x) = f(x) V —M and M / 00. 
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Then si M) > S n for all n and M, and therefore A( M )(a) > A(a) for all a > 0, 

where A( M ) is defined the same way as A, but with {si^} in place of {S n }. 
Then 'K 7r [f^ M \X)] < for large M, and consequently the reasoning leading 

to (2.10) applies to {S { n M) } and yields (9( A/ ))'(0) < 0, for large M, where 

exp(-6( M) )is the convergence parameter associated with {S^^} [as defined 
in Nummelin (1984), page 27; see also Section 5 below]. In Proposition 5.1 
and Remark 5.1 below, we will show that under (971), (H2) and (H3), the 
function 9( M ) is convex and 

6 (A/) (t A/ ) = A {M) (t M ) = 0, where x M = sup{a : K {M \a) < 0}. 

Therefore, (6( M ))'(0) < => t M > 0. Since A( M ) > A for all a > 0, we con- 
sequently obtain (Hi). 

Example 3.4. Consider a G ARCH (1,1) process with Markov regime 
switching, namely, 

(3.9) R* n = a n ^ n , where al = a + b 1 al_ 1 +a 1 (R^ l _ 1 ) 2 , n = 1,2,..., 

and where (ao,ai,&i) is now viewed as a random vector which will typi- 
cally oscillate between a finite number of states. As before, {£„} is an i.i.d. 
sequence of Gaussian random variables. [The following discussion applies 
equally well for an ARCH(l) process in place of a GARCH(1,1) process.] 
Such models have been studied rather extensively; for an introduction, see 
Lange and Rahbek (2008) and references therein. One motivation for con- 
sidering this type of dependence is that it yields some of the statistical 
properties of long-range dependence. This viewpoint has been introduced in 
Mikosch and Starica (2004). In any case, it is reasonable to assume that the 
parameters (ao,ai,&i) will change over time due, for example, to external 
economic factors, and this gives intuitive motivation for the dependence in 
(a ,ai,bi). 

Under Markov regime switching, it is usually assumed that the observed 
parameter values at time n are given by 

a Xn :=(a (X n ),a 1 (X n ),6 1 (X n ))€M 3 , 

where {^n} is a /c-state Markov chain which is exogeneous and hence un- 
observed. Given the transition matrix of the finite-state chain, the extremes 
of this econometric process can then be analyzed by our methods, just as in 
Example 3.1. 

An interesting variant would be to assume that is an ARMA process, 
driven by certain observed economic factors such as, for example, traded 
volume in the market, and then the extremes may be analyzed according to 
the methods in Example 3.2. It should be remarked in this example that if 
both (a\,bi) and ao depend on {X n }, then one needs a slight modification 
of the formulation of Section 2, as will be discussed below in Section 6.1. 
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To summarize, our conditions hold for a wide class of financial processes, 
such as regime-switching models, ARMA models and certain stochastic 
volatility models. Here we have developed the connection especially with 
ARMA models and with more complicated models which can be constructed 
from these, but it should be emphasized that our conditions are actually 
quite general. Indeed, our primary assumption is that the underlying Markov 
chain is light-tailed and therefore satisfies the usual moment conditions of 
modern large deviations theory, and for such processes, our results hold un- 
der very weak regularity conditions. It should, moreover, be pointed out 
that the investment process in Example 3.3 is actually not light-tailed — it is 
the driving Markovian process which is light-tailed, and for this reason our 
conditions can still be verified there. 

Finally, we mention that the recurrence equations studied here are also 
broadly relevant for a wide class of related processes arising outside of 
the areas of financial and insurance mathematics. For some recent results 
along these lines see, for example, Gnedin (2007) or the survey article of 
Aldous and Bandyopadhyay (2005). Applications in the direction of com- 
puter science can be found, for example, in Neininger and Riischendorf (2005) 
and references therein. 

4. Proofs of the main theorems. 

4.1. Sketch of the proofs. We begin with a brief sketch of the main ideas. 
Our starting point is the well-known regeneration lemma of Athreya and Ney 
(1978) and Nummelin (1978). This lemma asserts the existence of a sequence 
of random times, To, T\, T2, ■ ■ ■ , such that the blocks 

{XT i - 1 ,---,XT i -l}, Z = 0, 1, . . . , 

are independent for i > and identically distributed for i > 1 (where T!_i = 
1). Thus, in particular, the increments 

(4.1) S Ti -x - St^-i ■= f{X Ti _,) + ■■■ + f(X Ti -i) 

are independent for i > and identically distributed for i > 1. However, it is 
not immediately evident that a similar independence structure should exist 
for the risk process 

(4.2) W n := B x + A X B 2 + ■ ■ ■ + {A x ■ ■■A n _ 1 )B n , 

which is our primary object of study. In Lemma 4.2 below, we shall show 
that 

(4.3) W n = B + i £i + • • • + (io • • • Ak^Bk* + (i • • • A K *)Tl n , 

where {(Ai,B{) : i = 0, 1, . . .} is an independent sequence of random variables 
which, for i > 1, is identically distributed, 7Z n is a negligible remainder term, 
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and K*(n) f oo as n — > oo. It follows as a consequence of (4.3) that W := 
sup n W n "nearly" satisfies the random recurrence equation 

(4.4) W = B + max{M - B, A\V} 

for some random variable M, where (A, B) = (Ai,Bi) for i > 1. [A more pre- 
cise statement will be given in Lemma 4.2.] The random recurrence equation 

(4.4) is very similar in form to (1.1), which applied in the i.i.d. case. 

In Section 2 we argued that the probability of ruin is given by ip(u) := 
¥{W > u}. Thus, to determine this probability, we need to find the tail 
distribution of W. To this end, we apply a result of Goldie (1991), which 
states that under appropriate conditions, (4.4) implies 

(4.5) lim u v P{W > u} = D 

u — »oo 

for certain constants D and rj. Equation (4.5) provides a complete solution 
to our problem, but it remains to check that Goldie's conditions are actually 
satisfied and to identify the constant rj (which will be shown to equal r in 
the proof of Theorem 2.1). 

It is, in fact, quite challenging to verify that Goldie's conditions actually 
hold. To do so, we shall need to establish certain moment conditions for the 
random variables A,B,M appearing in (4.4); in particular, 

E[A a ] < oo, E[|B| a ] < oo and E[|M| Q ] < oo 

(4.6) 

for some a > r; 

see Theorem 4.2 below. The simplest of these studies A a = exp{a(£V;_i — 
Sr i _ 1 )}; cf. (4.1). After a change of measure it can be shown that, roughly 
speaking, 

(4.7) E[A a ] ^ E Q [e £{T ^ T ^ l) ] for some e > 0, 

for an a-shifted kernel Q, and thus we see that such moment conditions 
are closely related to geometric recurrence for the a-shifted chain. The ran- 
dom variables \B\ a and \M\ a are more complicated, but can be handled by 
somewhat similar techniques. 
We now proceed more formally. 

4.2. Formal proofs of Theorems 2.1 and 2.2. We begin with a precise 
statement of the regeneration lemma, first established by Athreya and Ney 
(1978) and Nummelin (1978). Here and in the following, set T_i = 1 and let 

r = Tj — Tj_i (i > 1) denote a typical regeneration time. 

Lemma 4.1. Assume (Mo) holds with k = l. Then there exists a se- 
quence of random times, < Tq < T\ < ■ ■ such that: 
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(i) To , T\ — Tq , T2 — T\ , . . . are finite a. s. and mutually independent; 

(ii) the sequence {T — Tj_i : i = 1, 2, . . .} is i.i.d.; 

(iii) the random blocks {Xx i _ 1 , • • • , X^-i} are independent, i = 0, 1, . . . ; 

(iv) P{X Ti € £|$ Ti -i} = v{E), for allE e 5. 

Remark 4.1. The regeneration times {T}j>o of Lemma 4.1 can be re- 
lated to the return times of {X n } to the set C in (M.q), as follows. First 
introduce an augmented chain {(X n ,Y n )}, where {Y n :n = 0, 1, . . .} is an 
i.i.d. sequence of Bernoulli random variables, independent of {X n }, with 
¥{Y n = 1} = 5, where 5 £ (0, 1] is given as in {Mq). Then T — 1 can be iden- 
tified as the (i + l)th return time of {(X n ,Y n )} to the set C x {1}. At the 
subsequent time, T, the random variable X^ has the distribution v given 
in (.Mo)> independent of the past history of the Markov chain. 

To apply the lemma in the context of our problem, assume now that ( M. ) 
holds, and let Tq,T±, . . . denote the resulting regeneration times. Then define 
the following random quantities formed over the independent random blocks 
of the previous lemma: 

A i = A Ti _ 1 ---A Ti _ 1 , a = 0,1,...; 

Bi = B Ti _ x + A Ti _ x B T ._ x ^ + ■■■ + {A Ti _ x ■ ■ ■ A Ti _ 2 )B T .^, i = 0, 1, . . . ; 
Mi = suplBr^ + At^Bt^+i + ■■■ 

+ {At,., ■ ■ ■ Aj^Bj : H-i <j< Ti}, i = 0, 1, . . . . 

By Lemma 4.1, {(Ai, Bi, Mi) :i = 1,2,...} is an i.i.d. sequence of random 
vectors which is also independent of (Aq, Bq, Mq). 

Our objective is to study the tail behavior of the random variable W 
defined in Section 2.1. Let be defined in the same as W, but under 
the assumption that To = 1, that is, under the assumption that regenera- 
tion occurs at time one. Also let (A,B,M) = (Ai, B±,Mi), and assume that 
(A,B,M) is independent oi {(A i ,B i ,M i ):i = 0,l,. . .}. 

We begin by establishing the following. 

Lemma 4.2. = B + max{M - B, A\V^} and W = B + max{M - 

B ,A W*}. 

Proof. For n > 1, set 

K* = K*(n) := M{i : T > n} - 1. 

Then for any n > 1, 

W n = {B l + A X B 2 + ■ ■ ■ + {A x ■ ■ ■ Azb-2)Bzb-i) 
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+ (Ai ■ ■■A To -i){B To + h (Aj> • • ■ A Tl - 2 )B Tl -i) 

+ •• • + (Ai • ••^_ 1 _ 1 )(S T ^_ 1 + • • • + (Ar^.! • ■■A Tk ^ 2 )B Tk ^_ 1 ) 
+ {A 1 ---A TK ^ 1 ){B TK ,+--- + {A TK ,---A n _ 1 )B n ). 

Hence 

(4.8) W n = B + i ^i + • • • + (i • • • Ak^^Bk* + (I ■ • ■ i*. )ftn, 
where 

7^ := B Tk , + A Tk , B Tk ,+i + ■■■ + (At k * • • • A n -i)B n . 
Note that this last definition and the definition of M, imply 

M = sup{7£ n : Tj_i < n < Ti} for all £ = 0, 1, — 
Hence by (4.8), 

(4.9) W :=supW n = supVi, 

n>l i>0 



where 



Vi 



M , i = 0; 

Bo + A M u i = l; 
Bo + i #i + • • • + (A) • • ■ ii- 2 )Si_i 

+ (A ---A i _ 1 )M i , i = 2,3,. 



Moreover, by a repetition of the same argument, 
(4.10) W fR = supV} 1) , 



i 

i>0 



where V^ 1 ^ is defined the same as Vi, but with all subscripts increased by a 

factor of one, so that Vq 1 ^ = Mi, and so on. 
Next observe that 



W = sup ^ := sup{M , B + A Mi,£ + A B X + AoA x M 2 , . . .} 



(4.11) = Bo + sup{M - B , A M h Ao{B x + AM), . . .} 
= Bo + max{M - B , A W^}, 

where 

W® := sup{M,-, Bj + AjMj+uBj + AjM j+1 + AjA j+1 M j+2 , ...} = W* 

Vj > 1. 
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This establishes the second assertion of the lemma. For the first assertion, 
note by a repetition of (4.11) that 

= sup^ (1) =Bi+ max{Mi - B^AxW^} 

= B + max{M - B, A\V m }. □ 

Set 

(4.12) ? ? = sup{a:logE[i Q ] <0}. 

Now under our basic assumptions on {^4 n }, log^4 is nonarithmetic. Hence 
by combining Lemma 4.2 with Theorem 6.2 of Goldie (1991), we obtain: 

Theorem 4.1. Suppose n > and E[A a ] < oo for some a> rj. Further 
assume 

(4.13) E[\B\ TI ]<oo and E[\M - B\ v ] < oo. 
Then 

(4.14) lim vPF{W* >u} = D, 

u — >oo 

where D £ [0, oo) is given by 

(4.15) D = —E\((B + max{M - B, AW*}) + ) V - {{AW^) + )^] 

nrh 

and fa := E[A^ log A}. 

To apply the above result, we first need to develop some properties loosely 
related to geometric r-recurrence, which describe the moments associated 
with A, B and M. 

Theorem 4.2. Assume that (Wl), (H 2 ) and (H 3 ) are satisfied. Then for 
sufficiently large a > 0, there exists an a > r such that 

(4.16) E[A a ] < oo, E[\B\ a ] < oo and E[\M\ a ] < oo 
and for a. a. id S, 

(4.17) E x [A%]<oc, E x [|B r] <oo and E x [\M \ a ] < oo. 



The proof of Theorem 4.2 is not straightforward and poses a main mathe- 
matical obstacle for our approach. This proof will be given below in Section 
5. We now proceed directly to the proofs of Theorems 2.1 and 2.2. 
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Proof of Theorem 2.1. First assume r = 77, where ij is defined as in 
(4.12). By Theorem 4.2, the conditions of Theorem 4.1 are then satisfied; 
and hence by (4.14) we obtain 

(4.18) lim u'FiW^ > u} = D asit^oo, 

u — >oo 

where D < 00 is given as in (4.15). 

Since E[A a ] < 00, some a > t, it follows by (4.18) and a result of Breiman 
(1965) that 



(4.19) lim ifFt AqW* > u\ = DE\A x n ] asu^oo. 

Also, by Theorem 4.2 and an application of Chebyshev's inequality, 

(4.20) F{B > u} < Dm' 01 and F{M - B > u} < D 2 u~ a , 
where a > r and D\, D 2 are finite constants. Moreover by Lemma 4.2, 

(4.21) W = B + max{M - B , AqW*}. 
Hence 

(4.22) lim u v P{W > u} = DE[Ak] as u -> 00, 

which establishes (2.20). 

It remains to show that r = rj. Suppose false. Let (9(a))~ 1 denote the 
convergence parameter of the kernel P a in (2.8), and let = log#. [For 
the definition, see Nummelin (1984), page 27.] Then it is well known that 
0(a) < A(a); cf. Proposition 5.1(h) below. Hence 

(4.23) r = sup{a : A(a) < 0} < sup{a : G(a) < 0}. 
Moreover, it follows from Nummelin [(1984), Proposition 4.7(h)] that 

(4.24) E Va [e aS ' Te ^]<l, 

where r = X L — Tj_i and S = S^-i — St<_i-i f° r i > 1- [See Ney and Num- 
melin (1987a), Section 4, and Proposition 5.1(iv) below for closely related 
results.] It follows as a consequence of (4.24) that 

(4.25) 8(a) < => 11(a) := E[e a§ ] < 1 

and therefore by (4.23), r < rj. Hence, if r 7^ 77, then we must have r < a < rj 
for some a. 

Now assume that this is the case, and choose a € (t, 77). Set 
Li = sup{Sj - 5 Tl _! : Tj_i < j < Ti}, i = 0,l,... 
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(where T_i := 1), and set L = Li for i > 1. By taking {B\, B2, . ■ ■} = {1, 1, . . .} 
in Theorem 4.2, we obtain that for sufficiently small a > r, 

(4.26) E[e Qi ] < 00, E[e aLo ] < 00 and Efe^o- 1 ] := E[A%] < 00. 

Then by the independence of the regeneration cycles [cf. Lemma 4.1], 



E[e 



CtS„] 



<E 



(4.27) 



i=i 



< E[e oL °] +E[e" L ]E[e aST o-i] ^E[e a5 f < 00, 

i=0 

where the last step follows since a <r] 11(a) < 1, by the strict convexity 
of IT. By (4.27) and the definition of A, we conclude that A(a) < 0, that is, 
a < t, a contradiction. Therefore x = r]. □ 



Proof of Theorem 2.2. The proof is very similar to that of Theorem 
2.1, but easier, so we only sketch the details. The main modification is in 
Lemma 4.2. In particular, we need to develop analogous random recurrence 
equations for Wt» and W^. 

Let {(Ai,Bi,Mi) :i = 0, 1, . . .} be defined as in the discussion prior to 
Lemma 4.2, and recall from the proof of Lemma 4.2 [cf. (4.8)] that 

(4.28) W n = B + A B! + • • • + (1 • • ■ Ik*-i)#k* + (io ■ • • A K *)TZ n , 

where K* := inf{i :Tj > n} — 1. 

Observe that Woo '■= li^n— >cx>Wn exists a.s. by Theorem 2.1 of 
Goldie and Mailer (2000) and Theorem 4.2 above. Moreover, since K* — > 00 
a.s. as n — > 00, the last term in (4.28) converges to zero a.s. as n — > 00. Hence 

(4.29) Wco^Bq + AqW*, 

where is defined the same as W^, except that now To = 1, so that 
regeneration occurs at the initial time. Furthermore, a repetition of the 
argument leading to (4.29) yields 

(4.30) W*±£ + AW*, 

where (A, B) = (Ai,Bi),i > 1; cf. the proof of Lemma 4.2. The last equa- 
tion is a random recurrence equation, which can be analyzed using Goldie's 
(1991) Theorem 4.1. First note by Theorem 4.2 that the moment condi- 
tions in Goldie's theorem are satisfied. The required result then follows from 
(4.29), (4.30) and Goldie (1991), Theorem 4.1. [Specifically, by employing 
(4.29), (4.30) in place of Lemma 4.2 above, and by employing Goldie's The- 
orem 4.1 in place of his Theorem 6.2, and then reasoning as in the proof of 
Theorem 2.1, the desired limit result is obtained.] 
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For the process {W*}, define {Ai} as before, and let 

B* = (At { -i ■ ■■A T ._ 1+1 )B Ti _ 1 + ■■■ + A Ti _ x B T ._ 2 + B Ti ^ Vi > 0, 

M* = sup{(A Ti _i • • • A 3+l )\Bj | + • • • + \B Ti ^\ : Tj_i < j < !•} Vi > 0. 

[Note that the definition of -B* differs from that of Bi, because a given ele- 
ment B n is now multiplied by {A n+ \ ■ ■ ■ A^-i) rather than by (At i _ 1 • ■ ■ 

Let Z* := W^ t _ x denote the value of {W^} at its (i + l)th regeneration 
time (i = 0, 1, . . .). Since 

W^Bn + AnW^, n = l,2,..., 

it follows after a short argument that 

(4.31) Z* = B* + AiZ*_ x , i = l,2,.... 

First consider Z^ := lim^tx, Z*. By Theorem 3.1 of Goldie and Mailer 
(2000) and Theorem 4.2 above, the limit exists a.s., and by (4.31), it satisfies 
the recurrence equation 

(4.32) Z^^B* + AZ^, 

where (A,B*) = (Ai,B*) for i > 1. Then by (4.32) and Goldie [(1991), The- 
orem 4.1], 

(4.33) F{Z^ > u} ~ Cu- Z as u — > oo 

for some constant C < oo. As before, the required moment conditions are 
verified using Theorem 4.2. 

It remains to show that lim^^oo W* = lim^oo Z*. To this end, let L(n) := 
inf{f :Tj > n} denote the first regeneration time after time n. Then, for 
example, the term "B n " forms a part of the sum B* L ^ n y and the discrepency 

of W* from ^Lin) 1S bounded by M L ^ . More precisely, 

(4-34) Z* L(n) - M* L{n) < W* n < Z* L(n) + M* L(n) . 

By a minor variant of Theorem 4.2, M* is asymptotically negligible com- 
pared with linin^oo ^2(n) • Consequently = Z^. □ 

5. Proof of Theorem 4.2 and some related regularity results. 

5.1. Notation and preliminary remarks. In the following discussion, it 
will always be assumed that the minorization (A4) holds for a given param- 
eter a > 0. 

First we introduce some additional notation. For any kernel K, let 

K a (x,E)= [ e afiy) K(x,dy) VaGl, 

JE 
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and let 

K a (x, E) = [ e af{y)+(3h{y) K{x, dy) Vc* = (a, (5) G K 2 . 

JE 

Let (6(a))~ 1 denote the convergence parameter of P a [as defined in Num- 
melin (1984), page 27], and let (^(q;))" 1 denote the convergence parameter 
of P a . Let 9(a) = log 5(a) and 0(a) = log 0(a), for all a = (a, (3) £ M?. 
[We will repeatedly use the notation a = (a, 0) and generally do this with- 
out stating so explicitly] 

Below, we will often need to work with a perturbed kernel, namely, 

p( £) (x,E) :=P(x, E)+eu a (E) VEeS, e > 0, 

where v a is given as in (M). In this connection, let P^ ] , P^ ] , 6^, (e) , 
G( £ ) and (e) be defined the same as P a , P a and so on, but with P replaced 
everywhere in these definitions with p( s ). 
For each x G §, set 

oo 

ri £) (^) = E(^ (£) («))" n_1 (^ £) " WW ® ^ o) ) B Mr.h(*) 

n=0 

and 

oo 

= E( 0(£) ("))" n_1 (^ ) ®»£ a) )%l^k(*), 

n=0 

where, for arbitrary g : § — > K and /j, : 5 — > R: 

g®»(x,E)=g(x)nm ^ a \E) = [ e^^dy); 

JE 

^ a )(E) = f e af ^ +/3h ^^(dy). 

JE 

In the special case that e = 0, we shall simply write r a , r a in place of r£\ 
r (o) 

The function r^ is known to be (#( £ )(a)) -1 -subinvariant with respect to 
the kernel P a , and moreover to be (0( e )(a)) — ^invariant in the case that P a 
is (#( £ )(a)) _1 -recurrent; see Nummelin (1984), Theorem 5.1 and its proof 
[and the proof of Proposition 5.1(iv) below]. Likewise, the function r« is 
known to be (0^ e - > (o;)) _1 -subinvariant with respect to the kernel P a , and to 
be (£?( e )(a)) _1 -invariant in the case that P a is (0^(a:)) _1 -recurrent. 

Let 

(a) = limsup - log(P^) n (x, §) VaGR 2 , 

n— >oo Tl 
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where x is the initial state of the Markov chain. [If we replace p( £ ' with P in 
this definition, then the right-hand side reduces to the Gartner-Ellis limit, 
A, which was introduced in Section 2.] 

Finally, in the proofs below we will make use of the "shifted" kernel 



Je 6&(a)r ( a>(x) 



Under the minorization (A4), observe that the kernel Qa itself satisfies a 
minorization, namely, 

(Mq) gg)(x)tg\E)<Q$(x,E) VxeS, E G S, 

where 

\6&(a)r [ a>(x) J 2 

and B is a normalizing constant chosen so that [iht ^ is a probability measure. 
After a truncation of v a as described above in Remark 2.1, the integral in 
the definition of will always be finite and hence < B < oo. 



5.2. Some regularity properties. In the next two propositions, we col- 
lect various regularity properties which will be needed in Section 5.3. After 
reading the statement of the propositions, the reader may want to proceed 
directly to Section 5.3 — which contains the core results of this part of the 
paper — and refer back to the present section as necessary. 

In the following discussion, set inf{0} = oo and let IDa, ID a an d ID© 
denote the domains of A, A and 0, respectively. [To be entirely precise, 
a € IDa means that A(a) < oo for y>-a.a. initial states x, cf. part (i) of the 
next proposition; and our main results will be valid away from an appropriate 
set of measure zero.] 

We remark that in (i)-(iii) of the following proposition, we develop prop- 
erties of A^(a), ®( e \a), etc., but the same properties hold also for A^ £ \a), 
G( £ )(a),etc, as can be seen by setting a = (a, 0) and observing that A^ (a) = 
A( £ )(a), (£) (a) = 6( £ )(a), and so on. 



Proposition 5.1. Assume (M). Then: 
(i) For any a € IDa, 



(5.1) 



A(a) = A Ua (a), ip-a.a. x, 
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where A Ua denotes the same limiting quantity as A, but conditioned on re- 
generation at time zero. Thus, A is independent of its initial state. Moreover 
for allN>\, 

(5.2) A N (a) := sup - logE^ [e" 5 "^ 5 ^] < oo Vet G D A , 

n>N n 

where E^[-] denotes that regeneration occurs at time one. 

(ii) The function 0^ is convex, and ®( £ \a) < A^ £ \a) for all a G Da 
and e > 0. 

(iii) For any a G Da, 

(5.3) limA^(a) = A(a). 

(iv) If a G Da and e > 0, i/ien Qa is either a subprobability or probability 
measure. Moreover, if is {6 [a))" 1 -recurrent, then Qa^ is actually a 
probability measure. 

PROOF, (i) Let A > 0, and set 

F A = {x€S:A x (a)> A I/o (a) + A}, 

where A x denotes — now explicitly — that we are conditioning on the Markov 
chain starting in state x. Also, set 

H m {E) = E„ a [e<«. s "»> ; X m G E] Vm G Z+, E G S, 

where S n = (S n ,Sn). 
Note that 

(5.4) K a [e {a ' Sm+n) ]>K a \e {a ' Smh ,X m eFA] inf EJe^^l. 

x&F A 

Hence it follows from the definitions of F A and {[i m } that 

(5.5) A„,(a) > liminf-log/i m (F A ) + (A I/o (a) + A). 

Since fJ, m (F&) obviously does not depend on n, this last equation is only 
possible if /i m (-F A ) = for all m. Consequently, F{X m G F A } = for all 
m. Since ip(F&) > would imply that {X m } would visit Fa with positive 
probability over a regeneration cycle [Athreya and Ney (1978), Section 6], 
we conclude that <p(F&) = 0. Since the last equality holds for any A > 0, it 
now follows from the definition of Fa that A x (a) < A Ua (a) for </>a.a. x. 

Conversely, if £ m denotes the event that regeneration occurs at time m, 
then 

(5.6) E x [e^ Sm +^) >E a ,[e< a ' S ™>;£ m ]E,Je^ s ">]. 
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If to is chosen such that P{£ m } > 0, then E x [e( a > s ™) ; £ m ] > 0. Hence it follows 
upon taking limsup n ^ 00 n~ l log(-) in (5.6) that A x {ot) > A Ua (a). 

It remains to study (5.2). Taking limsup m ^ 00 m _1 log(-) on the left- and 
right-hand sides of (5.6), where £ m now denotes the event that X m £ C a h 
[the "small set" in (M.)], yields 

(5.7) A(a)>®(a) + lim — logEW[e ( "' Sn> ], 

where the first term on the right-hand side was obtained from the defini- 
tion of the convergence parameter [cf. Nummelin (1984), page 27. On the 
right-hand side, we have also used the fact that regeneration occurs with 
probability 5 a upon each return to C a h, independent of the prior evolution 
of the chain] . 

Now the term on the left-hand side of (5.7) is finite, by assumption; while 
the first term on the right-hand side is greater than — oo, by Nummelin 

(1984) , Theorem 3.2. Since the second term on the right (but inside the 
limit) does not depend on m, we conclude that it must also be finite, that 
is, 

(5.8) EW [ e <"' s «)] < oo Vn € Z+, a £ S) A . 

Furthermore, for any a € Da, it follows by Remark 2.1 (and the fact that 
we have just shown A = A Ua ) that 

(5.9) limsup-logEW^"' 8 "^ = A(a) < oo. 

n— >oo Tl 

[After truncation of v a , as described in Remark 2.1, we have suppf a C C a hn 
Cbf, where max{a, b} < oo, and hence e^ a ' Sl ^ is deterministically bounded, 
meaning that the superscript "(1)" may be removed when taking the limit 
on the left-hand side of (5.9).] Hence, the terms inside the limit in (5.9) are 
all finite and, in the limit, they converge to a finite constant. Consequently, 
(5.2) follows from (5.8) and (5.9). 

(ii) The convexity of 0^ is obtained as in Iscoe, Ney and Nummelin 

(1985) , Lemma 3.4. For further details, see Collamore (2002), Lemma 4.2. 
To see that 0^ < A^ £ \ note that the minorization (M) implies a corre- 
sponding minorization for P a , namely, 

(M) 5 a l Cah (xA a) (E)<P a (x,E) Vxe§,E£S. 

Then by the definition of the convergence parameter [Nummelin (1984), 
page 27] and Proposition 3.4 of Nummelin (1984), we see that (©^(a))" 1 
is the radius of convergence of the power series 

oo 
n=0 
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(since C a h is a "small set"). But (A^(a)) -1 is the radius of convergence of 

oo 
n=0 

Hence C a h C § 0^(a) < A^ £ \a). 

(iii) First observe [following the proof in Collamore (2002), Theorem 3.1] 
that 



(5.10) 

= E J n ^ a a) (dxi)J {n) (x 1 ,dx 2 )---J^(x n _ 1 ,dx n ), 
(u,...,in-i)ei s 

where J*- ) = ez> a , J*- 1 ) = P a and I consists of all elements of the form (ii, . . . , 
i n -i) such that ij € {0, 1}, j = 1, . . . , n — 1. What needs to be shown is that 

(5.11) A$(a) :=limsup-log£^ ) (o;)^A I/a (Q:) as e -> 0. 

n— >oo Tl 

[The required result then follows from (5.11), since by (i) we have A(at) = 
A Va (a), and a repetition of the same argument also gives A^ £ \a) = A)fJ(cx).] 
Let 

logb N = N{A 1 (a)-A N (a)} ViV>l, 

where A^y is defined as in (5.2). Setting crN(a) =expAAr(a), then by defi- 
nition 



(5.12) b,v 

\crjv(a). 

Note that if a 6 T>\, then it follows by (5.2) that b^v < oo for all N. Fix 
N G Z + and consider the products on the right-hand side of (5.10). Note 
that 



>(«)< 



^(dxi)(J (il) (xi,dx 2 ) • • • J (in) (a; n _i,dx n )), 



a 

consists of a product of blocks which each have the form 
e0^ a \dxj)(P a (xj,dxj + i) ■ ■ ■ P a (xj+i-\, dxj+i)) where j > 1 and / > 0. 

(When 1 = 0, it is understood that the product involving the P a terms is 
empty. Also, the leading "e" term only appears for j > 1.) Now the total 
number of blocks is k, where k is the cardinality of {j :ij = 0, 1 < j < n} + 1. 
Moreover for each block, 



(5.13) 



^ a) (e^)(P a (xj,dx j+1 ) • • •P Q (x j+ /„i,dx j+ /)) 
= E (i) [e («,s ;+1 > ] < ((7m(Q)) m i 
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where the last step follows from (5.2) and the definition of 07 given just prior 
to (5.12). Now by definition, cri(a) is decreasing in I for any fixed ex. Hence 
if 1 + 1 > N then 07+1(0:) < crjv(a), while if I + 1 < N then 07+1(0:) < 01(a), 
and hence 

(a l+1 (cx)) l+l <b N (a N (<x)) l+l 
by (5.12). Substituting these estimates into (5.13) yields 

(5.14) / 0( a a \dx j ){P a {x j ,dx j+1 )---P a (x j+l _ 1 ,dx j+l )) < b N {a N (a)) l+1 . 

JS i+1 

If the total number of such blocks attached to a given (ii, . . . ,i n —x) G I is 
denoted by k, then we obtain for each individual integral on the right-hand 
side of (5.10) that 

/ u^(dx 1 )J^(x 1 ,dx 2 )---J^(x n . 1 ,dx n ) 

(5.15) 

<e fc - 1 (b^(«)) fc (cx J v(a))"- fc . 
Summing over all . . . ,i n -i) £ Ij it follows from (5.10) and (5.15) that 

(5.16) 4 £ )(«) < ^(a N {a) + eb N a N (a)) n . 
Hence 

(5.17) limsup-log^ e) (a) < log(a N (a)(l + eb N )). 

n— >oo Tl 

Now let e — > and then N — > 00. Note that 

logOTv(o) = Ajv(o) — > A Ua (a) asiV^oo. 

Consequently (5.11) follows from (5.17). 

(iv) This is a variant of Nummelin (1984), Theorem 5.1. First set 7 = 
(O&ia))- 1 , and set 

00 

IC = l(Pi £) -5 a l Cah ®vf>) and G = £/C n . 

n=0 

Then Proposition 2.1 of Nummelin (1984) gives that G = I + KG, and thus 

(5.18) 7G5 a l Cah = jS a l Cah + l 2 (Pi £) - 5 a l Cah ® v^)G8 a l Cah . 
The first term on the left-hand side is 



(5.19) 7 G5 a l Cah := £ l n+ \Pi £) - 5 a l Cah ® v { a a) T5 a l Cah :-- 

n=0 



r (e) 
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This equivalence also identifies the middle term on the right-hand side of 
(5.18); namely, 



(5-20) 1 2 P^G6 a l Cah = 1 P^i 



a ' a 1 



Moreover, the last term on the right-hand side of (5.18) is 
j 2 (6 a lc ah ®0^)G5 a l Cah 

(5.21) = j6 a l c Jjt ^rt a) (Pi £) ~ S a lc a h ® ^T~XU a i\ 

f = 75 a l Ca h, if Pa is 7-recurrent, 
\ < ^5 a lc a h, if Pa is 7-transient, 

where the final step was obtained from Nummelin (1984), Proposition 4.7. 
[In Nummelin's notation, the quantity in brackets is identified as 6(7), which 
is equal to one in the 7-recurrent case and is less than one in the 7-transient 
case.] Substituting (5.19), (5.20) and (5.21) into (5.18) yields 

(5.22) r£> = iP®r£\ if P a e) is 7-recurrent, 
and 

(5.23) rf> > jP a e) r^\ if P^ is 7-transient. 

Hence, is 7-invariant when P a 6 ^ is 7-recurrent and 7-subinvariant when 
P a is 7-transient. Moreover, after rearranging terms in (5.22) and (5.23), 
we obtain by definition that 

q(?)( x g) •= l{Pg' >r a ) ){ x ) / = if Pa^ is 7-recurrent, 

Ta\x) \ < 1, if Pa^ is 7-transient. 

Hence, Qa is a probability kernel in the 7-recurrent case and a subproba- 
bility measure in the 7-transient case. □ 

Remark 5.1. In Proposition 5.1(h), it has just been observed that 

9(a)<A(a) = Va, 

but the reverse inequality need not be true, in general. However, under (971), 
(H2) and (H3), it can be shown that 

(5.24) 9(r)=0 = A(r). 

This last equation can be obtained as a consequence of Theorem 4.2 and the 
proof of Theorem 2.1. 
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To establish (5.24) based on the aforementioned results, first note that A 
is convex [by Holder's inequality] and finite in a neighborhood of r [by (H2), 
since A((a,/3)) is nondecreasing in (3\. Consequently A is continuous at r. 
The definition of r then implies that A(r) = 0; thus, it is sufficient to show 
that 9(r) =0. 

To this end, note that in the proof of Theorem 2.1, we have shown that 

r = 7 ? :=sup{a:logE I/ Je clS ] < 0} where S = Sr t -i - Sr^-i for i>l. 
Let 

r = sup{a : 0(a) < 0}. 

Now if r > r, then for any a 6 (t,t) we would have 0(a) < [since G is 
convex and a < t] , and K Va [e aS ] > 1 [since a > x] . Hence 

(5.25) E Ua [e aS ~ T@ ^]>E Ua [e aS ]>l 

and this is a contradiction to (4.24). We conclude that r < r, which means 
that 0(a) > for all a > r. Now 6 < A, and so (H2) =>• r £ intSe- Since a 
convex function is continuous on the interior of its domain, it follows that 
is continuous at r. Moreover, as we have observed, 0(r) < 0, and 0(a) > 
for all a > r. By continuity, we conclude that 0(r) =0, thus establishing 
(5.24). 

Proposition 5.2. Suppose that (M) and (H 3 ) are satisfied, and sup- 
pose that there exist (3 > and e > such that (r, (3) € intS A ( e ) . Then: 

(i) The functions ra and r« are uniformly bounded from above on 
C a h. Moreover, there exists a sequence {(aj,ej)}j<=z + , where (ctj,Ej) \ (t,0) 
as j — > 00, and the sequence {ctj} ■= {(ay,/?)}, such that if 

Cj := sup (x) and Vj := sup (x) Vj, 

then {Cj}j£z + and {Vj}j^z + are bounded from above by finite constants. 

(ii) The functions Va and are uniformly positive on §. Furthermore, 
if {(ctj,£j)} and {ctj} are given as in (i) and 

Cj := inf TaP{x) and Vj : = inf rfe (a;) Vj, 

x£C a h x£C a h 

then, possibly after passing to a subsequence, we have that {Cj}j<=z + and 
{T>j}j e z + are bounded from below by positive constants. 
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Proof, (i) Without loss of generality, we may assume that the constant 
D a in (H3) is greater than or equal to one. 

For any x 6 C a h and E € 5, it follows from (H3) that 

(5.26) PV(X,E)=[ e a ^P(x,dy)+e^(E)<D a j2Pi £ \x u E), 

JE i=1 

where xi e Ei for all i. Consequently, for any x £ C a h, 

K(x,E) := (P^-5 a l Cah ®i>i a) )(x,E) 

(5.27) i 

< Daj^n^E) + (DJ - l)5 a i>t\E). 
i=i 

Hence for all x € C a h and n> 1, 

/C n (x, £) < jf f £> a £ , dy) + (D a Z - 1)<5«# {dy)j IC 1 " 1 (y, E) 

(5.28) l ~ l 

= D a J2^ n (^E) + (DJ - l)5 a {v^K n - l )(E). 



i=l 

.(e) 



It follows from (5.28) and the definition of r« that, for any x £ £ a /i, 

00 

r^z) := ^(^(a))"^ 1 ^^!^)^) 



rc=0 

(5.29) 



DJ - 1 



9^ (a) ' 

where, in the last step, the second quantity on the right-hand side was 
obtained from Nummelin (1984), Proposition 4.7(h). Thus is bounded 
from above on C a h, provided that the quantity on the right-hand side is 
finite. But ra is necessarily finite 92-a.s. [Nummelin (1984), Proposition 
5.1]. Since the E^-sets in (H3) are 92-positive, we may then choose x±, . . . ,xi 
such that r&\xi) is finite for each i. 

To establish uniform boundedness along a sequence {(cxj,£j)}, where (ay, 
Ej) \ (r, 0) as j — > 00, set 

(5.30) qi(x)= hminf r^(x) Vx € 8, 

(a,e)\(t,0) 

where the limit is taken along the straight-line path joining (t,0) to some 
point (a, e) with a > r and i > 0. 
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We begin by identifying the limit of ®( £ \a) as (a,e) \ (r, 0). It follows 
immediately from the definition of that ®( e \a) > 0(a) for all (a,e); 
and hence 

(5.31) liminf 9 (e) (a) > lim 9(a) = G(r) > -oo Va, Ve > 0, 

(a,e)\(r,0) a\x 

where the last two inequalities follow from Theorem 3.2 of Nummelin (1984) 
and the convexity of G. Moreover, it follows by Proposition 5.1 that 

(5.32) 9 (e) (a) < A® (a) \ A(r) as (a,s) \ (r,0); 

and from the definition of r, we also have that A(r) < 0. [In the last step 
of (5.32), we have used the fact that A is convex, which follows from an 
application of Holder's inequality, and we have used our assumption that 
(r, 0) G int2)A, which implies that r € intS)A-] From (5.31) and (5.32) we 
conclude that 

(5.33) liminf 9&(a) = a G (0(t), 1]. 

(a,e)\(t,0) 

Recalling the definition of q\ in (5.30), now apply Fatou's lemma to the 
equation 

/ e af ^ri £ \y)P ( - £ \x,dy) < 9&(a)r®(x) 
Js 

to obtain 

(5.34) / e^q 1 (y)P(x,dy)<q 1 (x). 



Thus qi is a sub invariant function for the kernel P x . Also, by Nummelin (1984), 
Proposition 5.1, the set {x:q\{x) < oo} is full; hence, either q\ < oo 99-a.s. 
or qi = 00. 

To see that the latter is not the case, observe by the definition of and 
Proposition 4.7(h) of Nummelin (1984) that 

r^(x)ui a \dx) < 1. 

Applying Fatou's lemma to this equation yields 

(5.35) / qi (y)^\dx)<l. 



Consequently q\^oo. 

Hence, if E\ is given as in (H3), then 

(5.36) liminf r^\x\) := q\(x\) < 00, 99-a.a. X\ € E\. 

(a,e)\(r,0) 
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Therefore, for any given x\ G E\, there exists a subsequence {(a'j,e'j)}j £ z + 
such that 

(5.37) supr , 3 (xi) < oo. 

j J 

Next, repeat the same argument but with 

( e '.) 

(72(3;) :=liminfr / (x) 

j->00 a j 

in place of q±, to establish the existence of an element X2 € E2 and a subse- 
quence {(a'j,Ej)} C {(q^-,^)} such that 

(e") 

(5.38) supr Q ,? (x 2 ) < 00. 

j j 

Continuing in this manner, we obtain elements X{ G Ei, % = 1, . . . , I, and a 
sequence {(aj,£j)} where (ay,£j) \ (r, 0) as j — > 00, such that 

(5.39) supraj\xi) < 00, i = l,...,l. 

j 

Substituting this expression and (5.33) into (5.29) yields that for some 
positive integer Jo, 

sup ra J \x) < DJ (sup ra-\xi) + - ) \/x G C a h. 
j>j aj 

Since the quantity on the right-hand side is independent of x G C a h and 
j > Jo, we conclude that {Cj}j>j is bounded. 

The proof of these properties for the function is essentially the same. 
The main modification arises in the definition of q%, which is now replaced 
with 

qi(x) := liminf r&'(a;), 

where {(ctj,£j)} is given as above. The previous argument may then be 
repeated to obtain successive subsequences corresponding to functions qi, 
q 2 , . . . . (The choice of the elements x% G Ei may, of course, be different in 
this case.) As before, we obtain boundedness for the sequence {T> j}, but now 
along a subseqence { (atj k , Ej k ) } of {(aj,£j)}. Thus (i) holds. 

(ii) To show that is uniformly positive, note by definition that 

P ie) (x,E)>e [ e af ^v a {dy):=ei> { a a) (E). 
Je 
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Since r^ is (#( e )(a)) _1 -subinvariant with respect to the kernel Pa \ it fol- 
lows that 

(5.40) k 

> (^(a))" 1 ^ / r^(y)i>i a \dy) = const. > 0, 



where the last inequality was obtained from the positivity of Ja . Hence 
is uniformly positive. 

To establish the uniform positivity of {Cj}, note by the definition of rffi 
and the definition of K, [in (5.27)] that 

(5.41) r(f)(x) := ^(6^ (a))^ 1 (lC n 6 a l Cah )(x) > ■ 

Since liminfj^oo^^^Oj) = o G (0(t), 1] [by (5.33)], it follows that for some 
positive integer J\ > Jq, 

(5.42) inf rSr (x) > — > Vx G C a h. 
j>Ji 3 2a 

Hence — possibly after redefining the sequence {(ctj,ej)} so that (aj 15 e jj) is 
actually the initial term of this sequence — we obtain that {Cj} is bounded 
from below by a positive constant. 

The same reasoning shows that the sequence {T>j} is likewise bounded 
from below by a positive constant. □ 

5.3. Proof of Theorem The proof will be based on two lemmas. To 

motivate the first of these lemmas, recall that a Markov chain is geometric 
recurrent if 

(G) sup lE^fp 1 "] < oo for some p > 1, 

where r is a typical regeneration time. To establish (Q), one usually begins 
by showing that (Q) holds under the additional assumption that there exists 
an atom, namely, 

(Q 1 ) sup E^Lr 5 "] < oo for some p > 1, 

x£C a h 

where T denotes the first return time of the chain to its regeneration set, 
that is, X := inf{n : X n G C a h}. 

Let Q a ,r a be defined as Qa\ta but with e = 0; cf. Section 5.1. Then 
the goal of our first lemma will be to establish (£/') for the shifted kernel 
Q a ; that is, 



(5.43) sup E®[p % ] := sup E x 

x£C a h x£C a h 



r a {X ) 



< oo, 
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where E^[-] denotes the expectation under the measure Q a (and for this 
heuristical discussion, we assume that Q a is indeed a probability measure). 
Since 6(a) \ 0(t) w as a — > r, it is a natural strengthening of (G 1 ) to 
require that 

r p 1 e aS %r ^ x% y 



(5.44) sup E x 



r a (X ) 



< oo for some a > r. 



While (5.44) has a straightforward probabilistic interpretation, it is not 
sufficiently strong for our purposes because we will ultimately need the eigen- 
functions r a to be uniformly positive, which may not be true in general. 
Hence we will need to work with the perturbed kernel P^ rather than P. 
But is not a probability kernel, and consequently, we must first replace 

(5.44) with a more general series representation. To this end let 

(5.45) <p (e) (x,£)= / P( £ \x,dy) 

JEn{C a h) c 

and 

(5.46) P {e) {x,E)= f P®(x,dy) Ve > 0, 

JEnCah 

and set = «p<°) and P = P^ . Qualitatively, *p is a kernel which is nonzero 
and equal to P when {A^} avoids the regeneration set C a h, and P is a kernel 
which is nonzero when {X n } enters C a h. Thus, in particular, ?($ n ~ 1 P(x,§) 
describes the event that {X n } begins at state x, avoids the set C a h during 
its first n — 1 time steps, and enters the set C a h during the nth time step. 
Thus it follows from the definitions that 

' p J e aS ^r a {X % 



(5.47) E 



-1 tAJ 

Taix) ^ 



r a (X ) 

Our first objective is to establish the following variant of (5.44), (5.47). 

Lemma 5.1. Assume that (9JI), (H2) and (H3) are satisfied. Then for 
sufficiently large a > and any constant p > 0, there exist a constant D < 00 
such that 

(5-48) sup |^^f;p«((^))"- 1 pi £) r (f))(x)) <D, 



X 



uniformly for (a,e) G {(ctj ,£j)}j>j , where (ctj,£j) \ (r, 0) is given as in 
Proposition 5.2 and J is a positive integer. 

Proof. By (H2), there exist points a > r and (3 > such that A(d, j3) < 
00. Set a = (a, (3), and observe by Proposition 5.1 that 

{e) (a) < A (e) (a) \ A(d) as e -»• 0. 



40 



J. F. COLLAMORE 



Hence 

< oo for e > sufficiently small. 

By a similar argument, 

(e) (0) < 1 for e > sufficiently small. 

Then it follows from the convexity of 0^ that on the line segment joining 
the origin to (a, /3), the function 0^ always lies below max{&^ £ \0), 0^(a, 
In other words, for some e > we have 

(5.49) &&(a,^o\ <max{l,0^(d,/3)}<oo Va€[0,a]. 

Now by definition, ©^(a, j3) is nondecreasing in (3 [for fixed (a,e) and 
increasing f3], and &^ £ \a,(3) is nondecreasing in e [for fixed (ce,/3) and in- 
creasing e]. Consequently it follows from (5.49) that 

(e) (a,/3) <max{l,0^(d,/3)} < oo 

(5.50) 

Va G [r, a] , e G [0, i], and for /3 = ^x. 

a 

In other words, ®^ £ \a,f3) is bounded uniformly when (a,e) G [r, d] x [0, £]. 
Therefore, for any p > 0, and for a = (a, /?) and ft is given as in (5.50), there 
exists a finite constant a such that 

(5.51) e {£ \a) :=exp0^(a) < V(a,e) G [t,d] x [0,el. 

2p 

For the remainder of the proof, the constants (5 and a will now be fixed, while 
the parameters a and e will be allowed to vary within the range of values 
specified in (5.51). [Later, we will restrict (a,e) to those elements belonging 
to the sequence {(aj,Ej)} which appeared in the statement of Proposition 
5.2.] 

By the subinvariance of r«\ 

(5.52) / e af ^ +f3h ^P^{x,dy)r^(y) < (a)rg> (x) . 



Hence 

(5.53) e? a f e afiy) P i£ \x,dy)r {£) (y)<0( £) (cx)r {£) (x). 

J(c a hy 

By setting 

IC(x,E)= f pP^(x,dy) and V = r {£} 

JEC\(C n hY 
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and substituting (5.51) into (5.53), we then obtain 

(5.54) KV < -V. 

Now if (a,e) 6 {(aj,Ej)}, then it follows by Proposition 5.2 that is 
uniformly bounded from below on C a h. Hence for some positive constant c, 

(a)rL e) (x) > I Pif ) (x, dy)r£ (y)>cf (x, dy) 

(5.55) 

> c / (x,dy), 

where, in the last inequality, we have used the fact that h>0 P a > P a 

for all j3 > and a = (a, (3), and we have used the fact that P^\x, •) = 
on (C a h) c [cf. (5.46)]. If we now define the constant A > and function 
A : § -> R by 

(5.56) A = — ^ and A(x) = A, Vx 6 S, 

V ; 20^(a) W 

then from (5.55) we obtain 



From (5.54) and (5.57), we conclude 



(5-57) P^A<-r£:=-V. 



(5.58) KV <V -P^A, 

which may roughly be viewed as a drift condition satisfied by the kernel K. 
Next, we claim that 

m 

(5.59) 1/-/C m y > ^/C^P^A VmeZ + . 

n=l 

To establish this claim, proceed by induction, observing that (5.59) holds 
when m = 1, by (5.58). Now assume that (5.59) holds for arbitrary m. Multi- 
ply (5.59) on the left by K and apply (5.58) to the first term on the left-hand 
side to obtain 

/ s rn+1 

(5.60) (V - En' A) - K m+l V > K n ~ Y P_u A, 

71 = 2 

which is (5.59) with m + 1 in place of m. 
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Since fC m V > 0, it follows upon letting m — > oo in (5.59) that 
5 - 61 



n=l 



Now by the definition of P_( e \ we have that P^\x, •) = on (C a h) c , and by 
Proposition 5.2, is bounded from above on C a h for all (a,e) E {(ay,£j)}. 
Hence for any (a,e) G {(ay,£y)}, there exists a finite constant d such that 

/ P^\x,dy)r^(y)<d f P®(x, dy) = A(x) V-E € 5; 

that is, Pq^A > (A/d)P_^ rffi- Substituting this inequality into (5.61) yields 

(5-62) £ p"((^ ) ) n - 1 Pi £) r£ £ ))(x) < — F(x). 

r« (x) n= i Ar^(x) 

Thus we have established (5.48), except that the constant D on the right- 
hand side of that equation must be replaced with the quantity on the right- 
hand side of (5.62), which [from the definition of A given in (5.56)] may be 
identified as 

cr a (x) 

It remains to show that Da' is uniformly bounded from above when x £ 
C a h and (a, e) 6 {(ay, But by (5.51), G^ s \a) is uniformly bounded from 

above for a S [r, a] and e G [0, £]. Moreover, by Proposition 5.2, and V := 
r„ are uniformly bounded from above and below on the set C a h, provided 
that (a,e) 6 {(ay, ey)}- Thus we conclude that is uniformly bounded 
from above for x G £ a /i and a S {(ay,£j)} n ([t, a] x [0,e]) = {(ay, £j)}j>j, 
for some positive integer J. □ 

Lemma 5.2. Assume that (9JI), (H2) and (H3) are satisfied, and let 
S' n = S n + f(Xo). Then for a > sufficiently large and p > 1 sufficiently 
small. 



(5.63) E 



T-l 

^ p n e a S'n 

Ln=0 



< 00 /or some a > r. 
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PROOF. For any x eS and E € 5, set 

Q. { l\x,E)= f Q {£ \x,dy). 

JEn(C a h) c 

Then it follows directly from the definitions in Section 5.1 that 

Je e( s )(a)r { a'(x) 

and 

Now by Proposition 5.1, Q$ is a probability measure whenPi e) is (^(a))" 1 - 
recurrent, and Qa is a subprobability measure when P a e) is (^(a))" 1 - 
transient. First assume that Pa is (0^ e -'(a)) _1 -recurrent. 

Let p > a -1 , where a is given as in (5.33), and apply Lemma 5.1. This 
states that, uniformly in x € C a h and (a,e) 6 {(otj, £j)}j>j, there exists a 
finite constant D such that 

oo „ 

(5.64) = £ / ^(^ E )(«)) n (0^)"- 1 Q^(x ) d 2 /)l £aft (y) 
= Eg[(^W(a)) 5r ] 1 

where E^[-] denotes the expectation under the shifted measure Q { a ] . Now 
the probability measure Qa satisfies the minorization {M.q) introduced in 
Section 5.1. It follows that upon each return to the set C a h, regeneration 
occurs with probability g& (x). Let E!j?[-] denote expectation conditioned on 
the event that regeneration does not occur at the start of a cycle evolving 
from C a h. Then as a consequence of (5.64), we obtain 

(5.65) (l-gV(x))fcQ[(p0V(a)f]<D 

uniformly in x G C a h and (a,e) G {( a j,£j)}j>J- 

We begin by obtaining a lower bound for the function ga ■ By the defini- 
tion of ga^ [given in Section 5.1 under (Mq)], 
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where 

7i £) := g^r( sup/^W)" 1 j^r^(y)u a (dy) 

„( £ ) 



and we now assert that 7a > 7o f° r some positive constant 70. To establish 
this assertion, first recall by (5.33) that limmfj^oo^^Oj) = a G (6>(t), 1], 
and by passing to an appropriate subsequence, we obtain the convergence 
as a limit rather than lower limit; that is, 

(5.66) lim 0fo)(a*) = a G (0(t), 1]. 

Next recall by Proposition 5.2 that, for all (a,e) G {(<x/, £.,■)}, is bounded 
from above on C a h. Therefore, there exists a finite constant b such that 

(5.67) liminf 7 J j) > ^liminf / e a i f ( y) rt j ' \y)u a (dy). 

J-+00 6 j^oo 

Moreover, from the series representation of ra [given in its definition in 
Section 5.1], we also obtain that 

(5.68) liminfrMx) > 5 A^M. V x G S. 

Then applying Fatou's lemma to the last term on the right-hand side of 
(5.67) yields 

(5.69) liminf / e^Vfe 3 \y)u a (dy) > — [ e xfiy) v a (dy) > 0, 

j^<x> j§ a J c a h 

where the last inequality follows since suppz^ a C C a h; cf. Remark 2.1. Thus 
we conclude that — possibly after passing to a subsequence — {jap} is bounded 
from below by some positive constant, which we call 70, and we may assume 
that this constant has been chosen sufficiently small such that 7 o G (0, 1/2]. 
Now choose t > 1 sufficiently large such that 

(5.70) D < 



2(1 --yd)*" 

Then choose (a,e) G {(ctj,£j)}j£j and p > 1 sufficiently small such that 

0* /0( e )(W)\ t_1 
(5.71) t(_U) <* 

[This choice is possible due to (5.66) and the fact that we have chosen 
p > a -1 . The point (a,e) and constant p satisfying (5.71) will now be fixed 
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for the remainder of the proof.] Since ga\x) < 1/2, substituting (5.71) into 
(5.65) gives 



(5.72) 2D > E^ 



> mi 



Vx G £ a h, 



where the last step was obtained by Holder's inequality. Then (5.70) and 
(5.72) yield 



(5.73) 



D := sup Ej 

x£C a h 



< 



l-7o 



Our next objective is to show that if 

c ,_ pfl (£) (a) 
a 

then (5.73) implies that E^[C T ] < oo, all x G C a h. To this end, let N denote 
the random number of returns to the set C a h which occur before regeneration 
actually takes place; that is to say, regeneration occurs directly following the 
Nt\i visit to C a h. Then 

(5.74) sup E$[C\N] < ( sup E^[C % ]) N = D$ . 

x£C a h XCCah 

Now the probability that regeneration occurs upon any given return to C a h 
is given by g a (x) > 70. Thus (5.73) and (5.74) yield 

00 

(5.75) sup E£[C T ] < " 7o)"- 1 ^ n < °°- 

X&Cah n= l 

Moreover, since 0^ £ \a) \ a implies that £ > 1, we have 

pr-l 



.n=l 



<E^ 



1 1 



const. -E^C 7 ] 



(5.76) 

Therefore, since o < 1, 
(5.77) sup E<2 



Vx G C a h. 



T-l 



E(^ (e) («)) r 



Ln=l 



< sup E\ 



x&Cah Ln=l 



T-l 



< 00. 



X&Cah 

Finally observe that 

E^[( P e^(a)) n l {T>n} ] = (p0^Xa)) n (Q$-g&®^) n (x,S). 

More explicitly, under the minorization (Mq), 1 — ga describes the proba- 
bility that regeneration does not occur at any given time step, and (Qa^ — 
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— go 1 ) describes the transition kernel in the event that regen- 



eration does not occur. It then follows from the series representation of a re- 
generation cycle [as utilized, e.g., in Nummelin (1978) or Ney and Nummelin 
(1987a), Lemma 4.1] that 



E, 



T-l 



E(^ (£) («)) n 



.71 = 1 



n.=i 



(5.78) 



> 



1 



a 

E, 



r ( a '(x) n =l 
1 » 



:1 Ta\Xo 



[The inequality comes from the fact that, in the definition of ga\ we have 
truncated this quantity at the level 1/2. Also, inequality results since we 
have substituted P a for Pa ■] Hence 



(5.79) 



sup E x 

x£C a h 



=1 ra\X(j 



< OO. 



The eigenvectors can effectively be removed from this last expression, since 
(a,e) may now be fixed, and then r& is uniformly bounded from below 
by a positive constant, while r& is uniformly bounded from above on C a h. 
Moreover, suppf a C £ b f (cf. Remark 2.1), so Xq ~ v a ==» f(Xo) <b < oo. 
Hence S' n < S n + b, all n > 0. Therefore it follows from (5.79) that 

-T-l 



(5.80) 



E, 



J2p 

Ln=0 



n„aS' 



< OO 



as required. 

If Pi e) is (0 (e) (a))~ 1 -transient, then we can normalize the measure Qa ^ , 

multiplying it by an appropriate constant A > 1 , so that XQa ^ is a probability 
measure. Let 

R^(x,E) = XQ^(x,E) and m^(x, E) = XQ^(x, E) Vs.E. 
Since A > 1 , the measure satisfies the same minorization as Qa > namely, 



(Mr) g { a £) (x)fi^ (E) < 4 £ ) (x,E) Vx e S, E e S; 

cf. (Mr) in Section 5.1. Hence the regularity properties that have just been 

developed for g& may also be applied to the present case without any 
modification. 
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To obtain (5.63), choose p > A/a and apply Lemma 5.1, just as in (5.64), 
(5.65), to obtain 



(5.81) 



R 



0( £ )( 



A 



>(l-0W(z))E 



a 



A 



uniformly in x € C a h, where E^[-] denotes expectation under R conditioned 
on the event that regeneration does not occur at the start of the regeneration 
cycle. 

Choose t > 1 sufficiently large so that (5.70) holds, and then choose (a,e) 
and p > A sufficiently small such that 

J- /o(e)t„,\ \ t-i 



(5.82) 



P_ 
a 



(a 



Ao 



[As (a,e) \ (r, 0) and p \ A, the left-hand side of (5.82) converges to A/a < 
p, where the last equality follows from the original choice of p. Thus, it is 
always possible to find appropriate elements (a,e) and p satisfying (5.82).] 
Using (5.82) in place of (5.71), we then obtain (5.72), but with Ej[-] replaced 
everywhere with E^? [■] and u 9&(a)" replaced everywhere with "#( £ )(a)/A." 
The remainder of the proof may now be repeated without change [except 
that "#( £ )(a)" is replaced everywhere with "#( £ )(o:)/A"] to obtain (5.77) or, 
more precisely, 



(5.83) 



sup E« 

x£C a h 



T-l 

E 

n=l 



To complete the proof, note that 



E 



R 



T-l 

E 

.n=l 



p9( £ \a) 



p9&(a) 



p0( £ \a) 



< oo. 



(i^-si £) ®/4 £) ) n M) 



(5.84) 



= E 

n=l 

oo 

> E(^ (£) H) n (Qi £) -9i £) ®/4 e) ) n (*,S) 



71=1 



> E T 



E 



1 ra\X n ) „aS n 



:1 ^{Xq 



-p e 



An inequality arises in the second equation when, on the right-hand side, 
we replace X~ 1 ga\x) with the larger quantity ga\x). The final inequality 
is then obtained as in (5.78). Consequently we conclude that (5.79) holds 
and hence also (5.80). □ 



Proof of Theorem 4.2. Set 

S n = S n + /(Ao) 



Vn>0, 
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and 



« d 



S = S Ti -i - ^Ti-i-i Vi>l. 
Then by Lemma 5.2, 

(5.85) E[A a ] := E[e aS ] < oo for some a > t, 

which establishes the first assertion of the theorem. 

We now turn to the remaining estimates stated in (4.16). Assume that re- 
generation occurs at time zero and let r denote the subsequent regeneration 
time. Let 

M* = sup{|B | + Aq\Bi\ + ■ ■ ■ + (4) ■ ■■Aj-i)\Bj\:Q< j < r} 
and observe that 



(5.86) E[\B\ a ] < E[(M*) a ] < E Ua 



\B Q \ + Y,e S ^\B n \l {T 



>n} 



71=1 



Now if a > 1, then by Minkowski's inequality and the independence of {B n } 
from (r, {A n }), 



E„ 



an 1/a 



(5.87) 



^|Bo| + ^ e ^ 1 l B n|l{r>n}J 

oo 



< (1 + E[|BH) 1 + E E.Je^l^^^] 1 /" < oo, 

V n=0 / 

where the final inequality holds for sufficiently small a > r, since (H2 ) yields 
E[|i?| a ] < 00, while Lemma 5.2 yields 



(5.88) 



E Ua [e aS '"l {T>n} ]<Kp- n , n = 0,l,..., 



for some constants K < 00 and p > 1 . 

On the other hand, if a < 1, then in place of Minkowski's inequality we 
may apply the deterministic inequality 

(5.89) (y + z)" 1 < y 7 + z"< Vy > 0, z > and < 7 < 1, 

and the result follows in the same way as before. 

Finally consider (4.17). Assume, to the contrary, that there exists a ex- 
positive set F G <S and a finite constant b such that 



(5.90) 



f(x)>-b and E :E [M a ]=oo Vx E F. 
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Note 

M * := sup{|5 | +A Q \B 1 \ + -- + (A --- Aj^lBjl: < j < T } 

>\B \+ A sup{B 1 + A X B 2 + • ■ • + (Ax • • ■ Aj-jBj : 1 < j < T } 
:=|S |+A Mo- 

If a > 1, then it follows by Minkowski's inequality and (5.90) that 

(5.91) E x [(M *) Q ] 1/a > -Efl^H 1 / + e- b/a E x [M^ a Vx G F. 
Then by (5.90), 

(5.92) E x [(M *) a ] = oo Vx G F. 

If a < 1, then we may apply (5.89) in place of Minkowski's inequality, and 
the conclusion still holds. 

Now suppose that regeneration occurs at time zero, and set 

T = inf {n G Z+ : X n £ F} A r. 

Since <p(F) > 0, {^n} will visit the set F over the given regeneration cycle 
with positive probability. Since E[M Q ] < oo implies v a {F) = 0, this means 
that P{T < t} > 0. Let 

N* = sup{\B T \ + A T \B T +i \ + ■ • • + (A T ■ ■ ■ Aj.^Bj \:T<j<t} 

and observe by definition that 

(5.93) M* > (A 1 ---A T -i)N*. 
Now conditional on the event £ := {T < r}, 

E £ [E £ [(Ai • • • A T -xN*) a \T, Ai, . . . , A T -i,X T ]] 

(5.94) 

= E £ p 1 ---A r -i) a E Xr [(iVT]]- 

Moreover, the left-hand side of (5.94) is finite, since (5.93) holds and we 
have shown that E[(M*) a ] < oo. However, on the right-hand side of (5.94), 
the term Ex T [(A r *) a ] is infinite due to (5.92). (In particular, note that the 
definitions of N* and Mq are the same, except that the quantities in the 
definition of N* are conditional on an initial state Xq- G F, while the quan- 
tities in the definition of Mg are conditional on an initial state Xq = x G S.) 
Consequently, we conclude 

E £ [(A 1 ---A r -i) a }=0. 

But since T < t < oo a.s. and A n > for all n, this is impossible. Hence 
E^Mq*] < oo for (p-&.&. x, and then E x [Bq ] < K x [Mq] < oo, 93-a.a. x. Setting 
B n = 1 and observing that in this case Bq > Aq, we also obtain E x [Aq] < 00, 
(f-a.a. x. □ 
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6. Some extensions. 

6.1. A general Markovian model. In this section, we consider an exten- 
sion which allows the sequence {B n } to be Markov dependent, and for there 
to be dependence between {A n } and {B n }. Now assume 

(6.1) logA n = f(X n ) and B n = G(X n ), 

where / : § — > K and G : § — > K. For simplicity, suppose for the moment that 
|-Bn| > 1 for all n, and set g(x) = logG(x). Let 

S n =logAi H hlogA n _i + log|5 n |, 

and let A be defined as A, but with S n in place of S n . In addition to (H2), 
assume that A(a) < 00 for some a = (a, (3), where a > r and /? > 0. 
An approach to this more general problem is to introduce the kernel 

(6.2) K a (x,E):= [ e aF{x ' y) P(x,dy) Vx e S, £ € 5, 

where F(x,y) = f(x) + (<?(y) — g{x)). Note from this definition that 

(6.3) K™(x,§)=E :l .[| J Bor a (^o---^n-i) Q |Sn| Q ], 

that is, K a now plays the role of P a , and similarly for the kernel &(x,E) := 
JEn(C a h) c K a {x, dy), R( e \ and so on. We note that the random quantity |Bo| 
may be bounded under an appropriate choice of suppz^ a , as in Remark 2.1. 
In this way we obtain as a direct extension of Lemma 5.2 that 



(6.4) E 



\B \+ T J2p n ~ leS "- 1 \ B n\] 

71=1 / 

which is the critical estimate needed to handle the present generalization. 
Finally, the assumption \B n \ > 1 may be dropped by replacing \B n \ with 
\B n \ + 1 in the relevant parts of the proofs. 

In practice, dependence arises in the sequence {B n } if, for example, the 
premiums of the insurance company are determined by a bonus system, in 
which case they depend on the observed claims in the previous time intervals. 
In this setting, {B n } may be modeled as a function of a Markov chain on 
K, say. See Lemaire (1995) for general background on bonus systems, or 
Bonsdorff (2005) for some recent developments. 

6.2. Generalizations of (Wl). Finally we discuss certain generalizations 
of the minorization condition (971) . A simple extension — still general enough 
to handle, for example, the AR(p) model when p > 1 — is to replace the 
Markov chain {X n } with 

(6.5) X n := (X( n _!) fc+1 , . . . ,X n k), re = 0,1,..., 
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for any positive integer k, and to introduce the following minorization con- 
dition: 

(Mi) S a lc a h(x)u a (E)<¥{Xi€E\Xo = x} Vxe§\EeS k . 

Set A n = -A(n-i)fc+l • • • A n k and assume that the analogs of (Hi)-(H3) 
hold. Then a repetition of the argument given in Lemma 5.2 yields 

E[A% + pA^At + ■■■+ p T ~ x A% ■ ■ ■ A%_y] < oo 

(6.6) 

for some a > t and p > 1, 

where r denotes a typical regeneration time of the chain {Af n }. From (6.6) 
we immediately obtain the estimate for A given in Theorem 4.2 [although 
not the estimates for B or M, since (6.6) omits terms from the sequence 
on the left-hand side of (5.63)]. To obtain the remaining estimates of that 
theorem, we could, for example, introduce the random variable 

(6-7) B n = B( n _ 1 ) fe+1 -| h A(n_^ k+1 ■ ■ ■ A nk _iB nk 

and follow the approach just outlined in the previous section. As an alter- 
native and more direct approach, we could work explicitly with the origi- 
nal sequence {B n } and with (Mi), but introduce a strengthening of (H2). 
Namely set logA n = max{log^4 n , 0} and A n = ^4( n _i)fc + i • • • A nk , and assume 
the further requirement: 

(H' 2 ) Hypothesis (H2) holds with S n := logyli + ■ • ■ + log A n in place of S, 

[A precise statement of this condition would also involve a slight change in 
the definition of h; see the discussion below, where we explicitly verify this 
condition in the AR(p) case.] A slight variant on our argument then yields 

E[A% + pA^A^ + ■■■ + P T ^A^ ■ ■ ■ A°_ 2 A^i] < 00 

(6.8) 

for some a > r and p > 1 , 

which is a mild extension of (6.6). If {B n } is i.i.d. and independent of {A n }, 
then we consequently obtain the required estimates in Theorem 4.2 for B 
and M and hence the limit results stated in our two main theorems. In 
summary, if we assume the stronger condition (H 2 ), then our main results 
hold under the alternative minorization condition (A^i). 

For example, if {X n } is an AR(p) process with p > 1, that is, 

p 

(6.9) X n = a,iX n ^i + Cn, n = l,2,..., 

i=i 

where {Cn} is an i-i-d. Gaussian sequence and Xq = x £ R, and if 
f(x) = x — p for some p > 0, 
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then (Mo) will not hold with k = 1. However, the minorization (.Mi) and 
associated regularity conditions can be verified explicitly. 

To demonstrate that our conditions actually hold in this case, we begin 
by expressing (6.9) in matrix form. Namely, for each n G Z + set 



X n -p+l t 



Cn 





V o y 



and 21: 



( a\ 
1 



\ 



1 0/ 



p.j-1 are taken to be arbitrary deterministic values). It 
aitions that 

71 = 0,1,.... 



(where Xq, . . . , X 
follows from these definitions that 

(6.10) X„ = 2tX„_ 1 +C n , 

Set Xq = Xo, X\ = X p , X2 = X2 P , — 
From (6.10) we obtain 

(6.11) X n , = 2l n X + 2l n ~ 1 Ci + --- + Cn, n = 0,l,.... 
Hence, in particular, 

(6.12) Xp = 2l p X + W p 

for some random vector W p ~ Normal(0, S), where S is a covariance matrix 
which is easily seen to have rank p. 

From (6.12) it follows that for any a > 0, there exists a finite constant b 
such that 



(6.13) 



X G B a (0) 



2l p X € B b (0), 



where B r (x) denotes a ball of radius r about x. Let & x denote the Normal(x, S) 
density function, and let 

$ i (y) = mf{<f> x (y):x£B b (0)} Vy G W. 

Note that $ 6 is positive everywhere since, for any fixed y, & x (y) is continuous 
as a function of x and hence achieves its minimum on the compact set Bb(0). 
Then by (6.13), 



(6.14) 



$b(y)dy < P(x,E) Vx G B a (0), E G S p . 



Consequently (Mi) holds with v a (dy) = c& b (y) dy and c G (0, 00) a normal- 
izing constant. 

The verification of (Hi) is likewise straightforward. For example, the rate 
function r appearing in (Hi) is just the nonzero point at which A(r) = 0, 
where A is the Gartner-Ellis limit for the process {S n }, and this limit may 
be computed, as in Example 3.2, by observing that S n is normally distributed 
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for all n, so it is sufficient to calculate the limiting values of its normalized 
mean and variance, which can be shown to converge as n —* oo. In this way 
we obtain, more explicitly, that A(a) = —am + a 2 a 2 /2 for certain positive 
constants m and a (the cumulant generating function for an appropriate 
normal distribution); thus, in particular, A'(0) < 0. 

For (H3), first recall that under an appropriate linear transformation, C, 
we have that 1— > ~ Normal(0, /), where I is the identity matrix. Let 
$ denote the Normal(0, I) density function, and fix a > 0. Then there exists 
a finite constant b such that if x £ dBb(0), then 

(6.15) $M<$x(y) yzGBa(0), y£<t 7 (x), 
where 

<t-y(x) := {loz :oj > 1, z 6 S 7 (ai)} 

is a 7-cone eminating from x and 7 > is a sufficiently small constant. Thus 
there exists a finite collection of points {x±, . . . ,xi} G dBb(0) such that 

l 

i=l 

where P denotes the transition kernel of {X n } under the transformation C 
Under the inverse transformation, we then obtain that for any given 
a > 0, there exist points {x\, . . . ,xi} lying on the boundary of some ellipse 
such that 

l 

(6.16) P(z,E)<J2P(xi,E) Vz£B a (0),VE, 

i=i 

and by a slight extension, we may replace Xi with B e (xi) (for some e > 0) 
on the right-hand side. 

The remaining condition to be verified is that (H2) holds with 

£ n :=logii + --- + logI n 

in place of S n , where log A n := max{logA n , 0}. Since log A n = f(X n ) = X n — 
fi for some [i > 0, we clearly have 

s n <\x 1 \ + --- + \x n \. 

To characterize the quantity on the right-hand side, note as a consequence 
of (6.11) that 

(6.17) |X„|<D(A"|X | + A^ 1 |Ci| + --- + |Cn|) Vn, 

where A is the spectral radius of 21 and D is a finite constant. Now under 
the standard conditions needed to ensure stationarity of the Markov chain 
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in (6.9) [cf. Brockwell and Davis (1991)], the spectral radius of 2t is less than 
one. Hence 

(6.18) 5„<D(|*o| + |Ci| + -" + lCn|) 

for some finite constant D. Since the sequence {Cn} is i.i.d., it follows that 

(6.19) E[e (a/D)ICl1 ] < oo lim sup — log E[e Q! '~' n ] < oo. 

n— >oo 71 

As (j has a standard Gaussian distribution, we conclude 

(6.20) limsupilogE[e a5 '"] < oo Va. 

n— »oo 71 

Now in a precise statement of (H2), we would also need to specify a 
function /i:8 p — >M which corresponds to the Markov chain {<-f n } rather 
than to {X n }. What is actually needed is that 

(6.21) limsup - logE[e a§pn+l3S " ) ] < 00, some a > r and /3 > 0, 

n— >oo 71 

where = /i(<-fci) + • • • + ft,(Af n ). In the above discussion, we have chosen 

h{x) = \\x\\ < \x±\ + h |xp|, where £ = (xi, . . . ,x p ). 

But then (6.21) follows from (6.20) [and its proof, since we may replace S n 
with Ya=i \X n \ in the deduction following (6.18)]. 

From these considerations, we conclude that if {X n } is an AR(p) pro- 
cess with p > 1, then the results of this paper still apply; in particular, the 
examples in Section 3 can all be considered in this more general setting. 

Finally, in the general ARMA(p, q) case, where 

v <J 

(6.22) X n = ^OjX n _j + ^2bjCn-j, 7i=l,2,..., 

i=l j=0 

it may easily be shown [cf. Meyn and Tweedie (1993), page 28] that 

(6.23) X n = b Y n + --- + b q Y n _ q , 

where {Y n } is the corresponding AR(p) process obtained by setting each bj 
in (6.22) to zero. Thus letting 

= (^n(p-l)+l j ■ • ■ > Ynp) an d A n = A( n -l)p+l ' ' ' A np , 

we see that A n = F(y n _i,y n ) (where, if necessary, we set Cj = to force 
p> q + 1). 

In this and more general situations, it is useful to consider a further 
extension of (Tl), as follows. Let {A n } again be defined as in the discussion 
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prior to (6.6) (i.e., with index "k" in place of "p"), and let {B n } be denned 
as in (6.7). Let V n = h(X n -\,X n ), and assume that 

for some function F : S 2k — > M 3 . Next introduce the following minorization 
condition: 

8 a l Cah (x)v a (E x r) < P{(A!i,^) € £ x T\Xo = x} 

(M 2 ) 

VxeS fc , EeS k , re<K 3 . 

Using a result in Ney and Nummelin [(1986), page 4] in place of Lemma 
4.1, we obtain under {M.2) that {(X n ,£n)} exhibits a regeneration structure 
and, moreover, that its transform kernel exhibits a minorization of the form 
(-A4) [where (M) is given as in the proof of Proposition 5.1]. Then one can 
proceed as before, although a rigorous analysis requires a careful treatment 
and further moment conditions on the process (A n , B n , V n ), which we do not 
pursue here. 
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